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ABSTRACT 


The  need  for  effective  methods  in  document 
searches  is  first  examined,  and  then  four  optimum 
criteria  are  presented.  Optimization  is  accomplished 
by  finding  a  set  of  optimum  weights  for  the  keywords 
in  the  question.  Each  of  the  four  optimum  procedures 
is  discussed  in  some  detail.  Examples  are  also 
presented  for  illustration  and  comparison.  The  least 
square  criterion  seems  to  be  a  promising  optimum 
procedure . 
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CHAPTER  1 


INTRODUCTION 

1.1  Need  for  Optimization 

Information  retrieval  is  a  field  concerned  with 
the  problems  of  organization  and  storage,  and  file 
structure,  as  well  as  retrieval  techniques.  With  the 
advent  of  computers  a  considerable  amount  of  attention 
has  been  focused  on  the  development  of  information 
retrieval  systems  in  which  the  computer  plays  a  basic 
role.  The  computer  not  only  permits  mechanized  handling 
of  data  by  existing  methods,  but  it  also  opens  up  paths 
for  new  approaches  which  are  otherwise  not  possible. 

In  this  respect,  a  few  experimental  interactive  systems 
have  been  developed  with  varying  degrees  of  success 
[1  to  4 ]  . 

The  majority  of  interactive  systems  operate  in  a 
real  time,  man-machine  dialogue,  mode  in  which  the 
system  accepts  requests  either  in  precoded  form  or 
in  natural  language.  In  the  analysis  phase  the  request 
is  analyzed  and  correlated  to  documents  in  storage. 
Highly  correlated  documents  are  produced  as  hits.  In 
some  cases,  such  as  in  the  SMART  system  [4] ,  the  user 
may  be  able  to  control  the  correlation  measures  by 
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specifying  weights  in  the  request.  Depending  on  whether 
the  user  is  satisfied  with  the  system's  initial  response, 
he  may  reformulate  the  request  in  order  to  obtain  an 
improved  response.  To  do  this  some  way  of  evaluating 
system  performance  is  needed. 

The  system  may  be  considered  to  be  efficient  if 
the  response  produces  highly  useful  items  in  terms  of 
relevance  and  quantity.  It  is  thus  possible  to  define 
two  ratios  "Recall"  and  "Precision"  for  the  evaluation 
of  a  system's  performance.  Recall  is  the  proportion  of 
relevant  items  in  storage  that  are  successfully 
retrieved,  while  Precision  is  the  proportion  of  retrieved 
documents  that  are  actually  relevant. 

Although  high  Recall  and  Precision  values  may  be 
chosen  as  the  measure  of  a  good  response,  it  requires 
a  manual  examination  of  the  collection  in  storage  to 
determine  the  total  number  of  relevant  items  for  each 
gly-gj}  request.  While  this  method  of  evaluation  may  not 
cause  difficulties  in  small  experimental  systems,  it 
is  completely  unsatisfactory  in  normal  environments 
where  very  large  data  collections  are  likely  to  occur. 

To  overcome  this  shortcoming  Salton  [5]  has  suggested 
sampling  techniques,  or  the  use  of  approximate  Recall 
values  based  upon  search  requests  on  limited  source 
documents  in  the  collection.  In  spite  of  his  suggestions 
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it  is  felt  that  Recall  and  Precision  measurements  will 
be  regarded  as  idealized  theoretical  concepts,  and  are 
not  useful  tools  in  practice. 

A  more  direct  approach  to  the  prediction  of 
system  efficiency  is  to  find  a  set  of  optimum  weights 
for  each  request.  Through  this  optimization  it  is 
intended  to  restrict  the  system  response  to  the  most 
relevant  items  for  that  request. 

Barkle  [ 6  ]  has  discussed  a  method  of  arriving 
at  the  optimum  weights  in  terms  of  a  probability  ratio, 
called  the  specificity,  which  relates  the  frequency  of 
occurrence  of  a  keyword  in  relevant  items  to  the 
frequency  of  occurrence  of  it  in  the  whole  collection 
to  be  searched.  The  weights  are  then  made  proportional 
to  the  specificities.  It  is  expected  that  the  higher 
the  specificity,  the  greater  is  the  probability  that 
an  item  containing  the  keyword  will  be  relevant. 

1.2  Analogy  in  filter  theory 

An  alternative  approach  to  the  problem  of  obtain¬ 
ing  optimum  weights  may  be  made  through  the  use  of 
filtering  techniques.  Suppose  a  signal  f  (t)  is  received 
and  it  contains  a  message  g  (t) .  It  is  required  that 
g (t)  should  be  extracted  from  f (t)  with  as  little 
distortion  as  possible.  One  method  is  to  pass  f (t) 
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through  a  filter  designed  to  minimize  the  root— mean- 
square  error  of  the  noise  which  is  f(t)-g(t).  This 
results  in  the  well  known  Wiener  filter  [7,8],  An 
alternative  method  is  to  maximize  the  ratio  between 
the  peak  amplitude  of  the  signal  and  the  root-mean- 
square  of  the  noise  [9] ,  this  will  be  referred  to  as 
the  matched  procedure.  In  either  case,  realizable 
filters  can  be  constructed  to  suppress  the  noise 
background  as  far  as  possible  so  that  the  desired 
message  can  be  detected. 

In  like  manner,  an  information  retrieval  system 
can  be  designed  with  the  aim  of  giving  the  best  possible 
response  to  a  request.  In  the  present  study  the  corres¬ 
ponding  theory  of  filters  will  be  examined  very  briefly 
and  extended  to  lead  to  the  design  of  an  optimal 
retrieval  system.  The  optimum  response  can  then  be 
defined  as  the  best  subset  of  documents  retrieved  with 
minimum  distortion,  in  the  sense  that  the  Recall  and 
Precision  values  are  close  to  unity. 

1.3  A  simple  model 

In  its  simplest  form,  a  retrieval  system  may  be 
considered  to  include  a  document  processor  as  shown  in 
Fig.  1.1.  The  document  processor  can  be  regarded  as  a 
device  with  parameters  W  as  determined  by  each  input 
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request.  Output  response  will  be  obtained  after  each 
document  Z^(n)  has  been  processed. 


Request 


Z  (n) 


W 


document 

processor 


-  r  (n) 


Fig.  1.1  Document  processor  in  a  simple  system 


Suppose  a  request  for  information  is  converted  to 
a  specification  of  process  parameters  W  which  are  then 
used  to  process  each  document  Z_(n)  ,  the  choice  of  W  is 
to  be  made  automatically  according  to  some  optimizing 
procedure  as  shown  in  Fig.  1.2,  and  the  processing  of 
each  document  is  also  to  be  made  automatic.  Thus,  given 
a  retrieval  criterion,  each  request  determines  an  optimum 
set  of  weights  W  which  are  used  to  process  each  document 
in  the  entire  collection.  It  is  evident  that  different 
requests  determine  different  sets  of  weights. 

A  threshold  value  may  be  chosen  so  that  the 
response  of  each  document  may  be  compared  with  it  and 
a  ratio  t  obtained.  Each  value  of  t  is  considered  to 
be  the  system's  estimate  on  the  relevancy  of  the  document 
to  the  request. 
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Fig.  1.2  Automatic  retrieval  system  with  optimization 

1.4  Form  of  question  and  data  base 

The  user's  request  may  be  in  one  of  several  forms. 

It  may  specify  a  list  of  vectors  x  which  represent  actual 
titles  obtained  from  a  previous  search,  or  partial  search, 
and  which  are  found  relevant  to  the  question.  Alterna¬ 
tively  the  vectors  may  represent  highly  relevant  keywords 
chosen  for  the  question.  A  mixture  of  these  two  forms  of 
request  may  also  form  a  valid  question. 

It  will  be  assumed  that  the  question  consists  of 
keywords  coded  in  truncated  form,  the  use  of  truncation 
having  proved  to  be  useful  [9] .  In  this  form,  the 
question  can  be  considered  as  a  vector,  the  components 
of  which  are  the  weights  that  may  be  negative,  positive 
or  zero.  The  simplest  case  is  when  they  take  on  binary 
values.  Then  with  reference  to  a  chosen  set  of  para¬ 
meters  the  i^^1  element  will  be  1  if  a  keyword  in  the 
question  matches  the  i^  parameter  in  the  given  set  and 
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it  will  be  zero  if  the  keyword  does  not  match.  Documents 
may  be  represented  by  vectors  in  a  similar  manner,  and 
such  representation  proves  to  be  very  useful  for  mathema¬ 
tical  operations. 

The  data  base  used  is  the  ACOUS  tape  prepared  by 
the  Department  of  Computing  Science  at  the  University 
of  Alberta.  It  is  made  up  of  approximately  7,000 
entries,  each  of  which  refers  to  publications  that 
appeared  in  one  of  the  following  Journals  during  the 
period  1954-1969:  Journal  of  Acoustic  Society  of  America, 
Acoustica,  Soviet  Physics  (Acoustic) ,  and  Journal  of 
Fluid  Mechanics.  All  the  articles  are  in  the  fields  of 
acoustics,  fluid  mechanics,  or  related  areas.  It  is 
believed  that  this  data  base  is  large  enough  to  produce 
meaningful  results  for  the  present  investigation.  The 
choice  of  a  rather  specialized  area  instead  of  the  whole 
of  the  scientific  literature  permits  the  results  to  be 
analyzed  unambiguously  and  efficiently.  An  example  of 
the  form  of  the  references  on  the  ACOUS  tape  is  shown 
by  the  following  list: 

SOV  57  0003  0017ALMUK  PULSE  METHO  MEASU  NONUN  MOTIO  MAGNE 
TAPE 

ACO  57  0029  0654LYON  LOW  FREQU  RADIA  LOAD  BASS  REFLE 
SPEAK 

ACO  60  0032  0709ZANKE  DIFFR  LIGHT  TWO  ULTRA  WAVES  SAME 
FREQ 

ACU  59  0009  0 4 0 3CONNO  MODEL  AID  ACOUS  DESIG  AUDIT 
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The  first  three  letters  indicate  the  journal 
name,  followed  by  the  year.  The  next  four  digits 
give  the  volume  number  which  is  followed  by  the  page 
number.  Immediately  following  the  numerals  are 
author  name(s)  and  title  words.  These  are  truncated 
to  a  maximum  of  five  characters.  Although  truncated 
keywords  may  be  ambiguously  interpreted  in  some  cases, 
no  difficulties  have  been  encountered  in  the  interpre¬ 
tation  of  results  obtained  in  searches  on  the  ACOUS 
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CHAPTER  2 

MATCHED  SEARCH 

2.1  Criterion  of  maximum  average  response 

The  signal  detection  technique  employing  matched 
linear  filters  will  be  used  in  the  retrieval  problem 
presented  here.  The  search  for  relevant  items  will  be 
treated  similarly  to  the  detection  of  a  signal  within  a 
noise  background.  It  will  be  assumed  that  the  documents 
are  in  sufficiently  large  quantity  to  make  the  analogy 
plausible . 

As  mentioned  previously,  the  document  processor 

til 

is  linear  so  that  the  response  to  the  n  document  is 

r  (n)  =  E  w^  zi  (n)  (2.1) 

i 

where  the  w^  and  z^  (n)  respectively  denote  components  of 
the  vectors  W  and  Z_(n)  of  Fig.  1.1.  Thus  the  w^  describe 
the  request,  while  the  z^  (n)  describe  the  n-th  document. 

The  retrieval  criterion  is  that  r(n)  be  as  large 
as  possible  for  documents  relevant  to  the  request.  The 
measure  of  largeness  is  in  reference  to  the  average 
response  to  the  documents  of  the  whole  set. 

The  mean  square  response  of  the  document 

processor  to  the  whole  set  of  N  documents  is 
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(2.2) 


where 


(2.3) 


measure  between  the  i-th  and  j-th  keywords  in  each 
document . 

Without  loss  of  generality,  assume  the  request 
consists  of  M  vectors  x  representing  a  set  of  actual, 
or  hypothetical,  documents.  The  mean  response  to  x  is 


(2.4) 


where 


(2.5) 


Hence  the  vector-  X  denotes  the  average  of  the  M  input 
titles . 

The  set  of  weights  W  will  be  considered  as  chosen 


2  .  . 

optimally  if  the  response  r^  is  a  maximum  m  comparison 
to  r2.  Thus  for  each  weight  w . ,  the  following  equation 

TD  1 


m 


must  hold. 


0  ?  for  each  i 


(2.6) 
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or  x±  -  A  E  c  w.  =  0  (2.7)  * 

j  J  J 

2  2 

where  A  denotes  the  stationary  value  of  r  /r  . 

J  x7  m 

In  matrix  notation  equations  (2.2),  (2.4)  and  (2.7) 

can  be  expressed  as 


m 


wT  C  W 


r  =  X 

x  — T  — 


and  C  W  =  ^  — 


2 

The  stationary  value  of  (rx/r  )  is  then 


(WT  X)- 

wT  C  W 


=  X  T  c  X 


(2.8) 

(2.9) 

(2.10) 


(2.11) 


where  the  subscript  T  indicates  a  transpose  and  the 

superscript  -1  denotes  an  inverse  operation. 

With  the  set  of  w^  chosen  to  satisfy  equation 

(2.10)  the  processor  is  optimum  in  the  sense  that  for 

the  given  set  of  vectors  x,  it  gives  the  maximum  average 

response  r^  with  respect  to  the  root  mean  square  response 

r  .  Apart  from  a  constant  of  proportionality,  which 
m  ^ 


* 


Derivation  of  equation  (2.7)  appears  in  Appendix  C. 
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does  not  affect  the  value  of  r^/r^ ,  the  set  of  optimum 
weights  is  obtained  as 

W  =  C_1  X  .  (2.12) 

Thus  the  processor  response  to  any  document  Z (n)  is 
given  by 


With  the  value  of  the  threshold  level  chosen  as  the 
root-mean-square  response  r  ,  the  relevance  measure  t 
may  be  determined  as 


(2.14) 


A  cut-off  value  t  may  be  defined  as 


o 


m 


(2.15) 


so  that  for  each  document  processed,  a  comparison  may 
be  made  between  the  values  t  and  t  .  A  document  with 
a  value  of  t  considerably  smaller  than  tQ  will  be 
eliminated  as  being  non-relevant  to  the  request. 


2.2  Power-series  approximation 

It  appears  that  in  equation  (2.12)  document  pro¬ 
cessing  requires  a  matrix  inversion  to  determine  C  1, 
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a  process  in  which  computation  may  become  excessively 
cumbersome  when  large  matrices  are  encountered.  It 
is  possible,  however,  to  estimate  C  ^  by  an  approxima¬ 
tion  that  does  not  require  a  formal  matrix  inversion. 
Let 


C  =  D  +  N 


(2.16) 


where  D  is  a  diagonal  matrix  with  elements 


d  .  .  =  c  .  . 

lr  11 


(2.17) 


and  N  is  a  square  matrix  whose  elements  are 


n .  .  = 
ij 


c .  .  , 

il 


0 


i  ^  j 


(2.18) 


Equation  (2.13)  may  be  expressed  in  terms  of  D  and  N, 


r  =  X T  C  1  Z 


XT  (I  +  D  1  N)  1  D  1  Z 


(2.19) 


where  I  is  the  identity  matrix. 

In  terms  of  a  power  series  the  last  equation 

becomes 


r  =  X T  D  1  Z  -  X T  D  1  N  D  1  Z 


+xtd1nd1nd1z-+  ... 


(2.20) 
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If  C  is  nonsingular  the  series  may  converge,  and 
if,  in  addition,  the  diagonal  elements  of  C  are  con¬ 
siderably  larger  than  the  remaining  elements,  then  the 
series  will  converge  rapidly  and  the  high  order  terms 
may  be  neglected  without  serious  error.  It  may  be 
observed  that  for  a  reasonably  well  formulated  request, 
the  terms  c. .  are  larger  than  the  cross-correlated  terms 

c. ..  This  is  so  since  the  statistic  for  any  two  given 

ID 

keywords  to  be  in  the  same  title,  as  measured  by  c..  ^  , 
is  smaller  than  that  for  each  keyword  by  itself,  as 
measured  by  c.^.  Under  such  conditions  the  approximate 
expansion  may  be  used. 

The  first  order  approximation  of  the  response  r 
is  then 


r  =  E  ( 


x . 

i 

c .  . 

li 


x 


...  c .  . 
D  ^i  11 


c  .  . 

_DJi)  z 

c  .  .  : 

DD 


(2 . 21) 


The  mean  square  response  to  the  collection  of  documents 
can  also  be  expressed  approximately  as 


2  y  (Xi 

r  =  E  ( — — 

m  c .  . 

l  ii 


x  .  c  .  . 

-  E  —3—  -^-)  x . 

.  .  .  c  .  .  C  .  .  1 

D^1  11  DD 


(2.22) 


and  the  stationary  value  of  equation  (2.11)  becomes 


r  2 

(-£) 

r 

m 


x  .  x  .  c  .  . 

E  (=i-  -  E  Ji)  x, 

y/i-  ii  jj 


C  .  . 
11 


1 


1 


(2.23) 
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Consequently  t,  the  relevance  measure,  may  be  deter¬ 
mined  through  equations  (2.10)  and  (2.21). 

The  approximate  weight  for  the  i-th  keyword  may 
be  obtained  as 


x . 

l 


c  .  . 
11 


X  .  c  .  . 

E  —3—  .  — 
3^1  li  33 


(2.24) 


It  may  be  seen  that  the  contribution  from  the  second 

term  in  the  expression  is  small  in  comparison  to  the 

first  term  since  the  diagonal  terms  c. c. .  are  larger 

li  33 

than  the  cross-correlated  terms  c.  ..  Thus  the  weight 

13 

for  the  i-th  keyword  may  be  interpreted  as  being 
approximately  inversely  proportional  to  c.^,  the 
frequency  of  occurrence  of  that  word  in  the  data  base. 
This  is  to  be  expected  since  a  common  word  does  not 
carry  too  much  information  with  it  and  hence  has  a  low 
weight . 


2.3  Flow  Chart  of  Matched  Search 

The  search  procedure  based  on  the  above  analysis 
involves  two  separate  steps.  For  each  request  the 
entire  collection  is  initially  examined  for  the  values 
of  the  correlations  c^_.  and  the  mean  response  r^.  It 
may  be  observed  that  since  c^  is  a  quantity  dependent 
on  the  average  properties  of  the  whole  set,  a  search 
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through  the  entire  collection  will  usually  be  unnecessary 
and,  in  fact,  impractical,  for  collections  that  contain  a 
large  quantity  of  documents.  It  may  be  sufficient  to 
make  this  search  on  a  representative  subset  rather  than 
on  the  whole  set.  Alternatively,  sampling  strategy  may 
be  used  on  the  whole  set.  The  second  search  step  is 
then  initiated.  This  time  each  individual  item  is 
processed  for  matches  with  the  elements  in  X.  Values 
of  t  and  r  are  then  determined. 

The  search  procedure  may  be  summarized  as  follows: 

(a)  Elements  of  the  M  vectors  x  are  merged  and  arranged 
in  suitable  order  to  form  a  set  X. 

(b)  First  pass:  vectors  of  a  suitably  chosen  subset  of 
documents  are  compared  with  the  set  of  X  for  counts 
of  identical  keywords.  The  counts  are  required  for 
computation  of  c^  and  r^ . 

(c)  Second  pass:  computation  of  r  and  t  for  each  document 
processed . 

(d)  Each  response  is  examined  for  its  degree  of  relevance . 
Sorting  may  be  included. 

A  flow  chart  of  the  procedure  is  shown  in  Fig.  2.1. 


■ 
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Fig.  2.1.  Flow  Chart  of  Matched  Search 


■ 
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2.4  Application  of  matched  search 

The  search  procedure  as  outlined  above  corresponds 
to  a  one  parameter  form  of  question  in  which  the  only 
logic  used  is  "OR" .  Thus  a  question  containing  five 
keywords  A,  B ,  C,  D,  and  E  is  equivalent  to  the  speci¬ 
fication  : 

Profile:  AORBORCORDORE 

Suppose,  for  example,  that  a  request  is  made  for 
documents  related  to  the  noise  of  engines,  aircrafts 
and  fans,  and,  in  particular,  the  control  and  reduction 
of  noise  produced  by  them.  A  random  selection  from  some 
relevant  titles  gives  the  following  ten  titles: 

(1)  Cockpit  noise  in  small  aircraft, 

(2)  Noise  control  for  natural  gas  engine  installations, 

(3)  Accuracy  considerations  in  fan  sound  measurements, 

(4)  Effects  of  increased  inlet  guide  vane  rotor  spacing 
on  compressor  noise  reductions, 

(5)  Fans  and  pure  tones, 

(6)  Noise  control  of  on  board  auxiliary  power  units 
for  aircraft  use, 

Origin  and  characteristics  of  sound  generation  in 
compressors  and  fans, 


(7) 


■ 
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(8)  Pratt  and  Whitney  aircraft  experience  in  compressor 
noise  reduction, 

(9)  Reduction  of  compressor  noise  radiation, 

(10)  Reduction  of  turbomachinery  noise  by  absorption  and 
choked  flow. 

A  total  of  seven  keywords  were  picked  from  these 
titles  and  the  question  profile  was  as  follows: 

Profile:  Reduction  of  noise  from  engines,  fans  and  aircrafts. 

OR  AIRCRAFT 

OR  COMPRESS  (COMPRESSOR,  COMPRESSION,  ETC.) 

OR  CONTROL  (CONTROLLER) 

OR  FANS 

OR  ENGINE 

OR  NOISE 

OR  REDUCE  (REDUCTION) 

The  keywords  are  truncated  to  a  maximum  of  five 
characters  so  that  the  use  of  1 COMPR'  may  include  such 
words  as  compress,  compressor  and  compression,  among 
others.  Arguments  in  favour  of  such  word  truncations 
have  been  provided  by  Heaps  [10]  . 

The  search  was  made  on  the  ACOUS  tape  which 
produced  a  maximum  of  100  titles  in  decreasing  order 
of  their  relevance  (see  Appendix  b) .  The  following 
list,  retrieved  by  the  approximate  method. 


. 


is  a  subset  of  the  output  bearing  the  higher  relevant 
ratios.  Details  of  this  and  other  examples  will  be 
discussed  in  a  later  chapter. 
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Normalized 

Keywords  in 

Number 

of 

relevance 

title 

titles  retrieved 

1.000 

FANS, 

NOISE 

3 

.515 

REDUC , 

AIRCR,  NOISE 

1 

.  349 

NOISE, 

AIRCR 

8 

.  343 

AIRCR 

3 

.  202 

ENG IN, 

NOISE 

4 

.  196 

ENGIN 

5 

.  172 

NOISE, 

REDUC 

10 

34 

total 

Titles  further  down  in  the  list  become  less 
relevant  to  the  question.  This  is  because  the  majority 
of  these  contain  only  one  matched  keyword  in  each  title. 

A  title  with  only  the  word  'reduction'  may  not  bear  any 
significance  to  the  request  although  the  title  was 
included  in  the  list.  It  appears  then  that  the  subset, 

34  in  all,  may  be  regarded  as  the  desirable  answer  without 
much  loss  of  important  information.  The  cut-off  value 
for  this  question  may  be  considered  to  be  .172  (normalized). 
It  should  be  mentioned  that  although  cut-off  values  may 
be  chosen  automatically,  it  is  found  that  the  decision 


' 

. 
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to  retain  whatever  portion  of  the  relevant  items  in 
the  output  list  is  quite  arbitrary ,  and  therefore 
should  be  made  manually  after  the  search  is  completed. 

Results  as  obtained  by  the  power-series 
approximation  appear  quite  satisfactory.  It  remains 
to  be  shown  that  the  approximate  response  of  equation 
(2.21)  is  acceptable  in  terms  of  output  response 
accuracy.  The  exact  response  of  equation  (2.13)  is 
readily  obtainable  in  this  example  for  direct  compari¬ 
son.  Results  are  listed  below. 

Normalized  Keywords  in  Number  of 

relevance  title  titles  retrieved 


1.000 

NOISE,  FANS 

3 

.535 

REDUC,  AIRCR,  NOISE 

1 

.  362 

NOISE,  AIRCR 

8 

.  356 

AIRCR 

3 

.205 

ENGIN,  NOISE 

4 

.  198 

ENG  IN 

5 

.178 

NOISE,  REDUC 

10 

34  total 

As  can  be  seen  by  the  comparison  of  the  two  out¬ 
puts  ,  agreements  are  excellent,  exactly  the  same  number 
of  titles  were  captured  and  each  in  idential  order. 

The  approximate  method  results  in  a  slightly  lower  value 
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in  relevance.  This  is  to  be  expected  since  only  the 
first  two  terms  of  the  series  were  retained  during 
the  analysis. 

A  questioner  may  thus  use  the  approximation  with 
confidence  that  it  will  produce  almost  as  good  results 
as  obtained  by  the  exact  method.  The  approximation 
has  the  additional  advantage  of  being  faster  and  more 
economical  in  terms  of  computer  time  and  storage.  When 
the  available  amount  of  core  space  is  limited,  as  would 
be  the  case  in  some  installations,  or  if  the  question 
is  so  complex  that  round-off  errors  caused  by  matrix 
inversion  become  significant,  then  the  approximation 
is  distinctly  preferable. 

2.5  Effect  of  additional  cross-correlated  terms 

In  the  above  analysis  the  optimal  weights  were 
based  on  the  statistical  distribution  of  each  term  alone, 
the  probable  contribution  from  cross-correlated  terms 
was  not  discussed.  It  may  be  argued  that  the  computa¬ 
tion  of  optimum  weights  should  also  be  dependent,  in 
part,  on  the  frequency  of  occurrence  of  a  given  pair 
of  keywords.  It  is  expected,  however,  that  whatever 
this  contribution  may  be,  it  will  only  be  of  secondary 
importance . 


•  ! 
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It  is  thus  possible  to  define  a  secondary  set  of 
weights  which  specifies  the  extent  of  cross-correlation 
between  pairs  of  keywords.  In  a  question  containing 
the  words  A,  B,  C  and  D,  the  primary  weights  will  be 
denoted  by 

A  A  A  A 

W  ,  W  ,  w  ,  w 
AA'  BB  CC'  DD 


and  the  secondary  weights  by 


WAB'  WAC '  WAD '  WBA '  WBC '  et°* 

More  formally  let  the  response  to  the  n^  document  be 


r  (n)  =  E  w.  .  z .  .  (n)  +  E  w .  .  z .  (n)  z  .  (n) 

•  1111  •  /  •  11  1  I 

i 


E  E  w. .  z . . (n) 
.  .  in  n 

i  3 


(2.24) 


where  w. .  is  the  weiqht  associated  with  the  i ^  and  the 
il 
f  h 

j  keywords  in  the  same  document  and 

if  i  =  j 


z  .  .  = 
il 


z  . 
1 


(2.25) 


kz  .  z  t 

i  1 


i  1  j 


It  is  seen  that  z.  .  is  identical  to  z. .  ,  so  that  only 

13  1 

half  of  the  contribution  from  each  term  may  be  taken 


for  the  summation. 


■ 


■ 
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The  mean  response  to  the  M  hypothetical  documents 
will  be 


r  =  —  £  £  w ..Ex.. (m) 
x  M  .  .  i]  i] 

13  J  m  J 


=  E  E  w .  .  y .  . 
...  13  Ji] 

1  1 


(2.26) 


where 


r .  .  =  —  E  x .  .  (m) 
in  M  i] 

J  m  J 


(2.27) 


and  x. .  is  specified  by  a  relation  similar  to  z. .  as 

il  il 

expressed  in  equation  (2.25). 

The  mean  square  response  to  an  item  in  a  collection 
of  N  documents  will  be 


-2  1  „  ~2  ,  . 
rm  ='  N  E  r  (n) 


/\  /N 


=  N  S  E  ^Wij  ”k£  Zij(n)  Zk£(n) 
n  1  1  k  £  J 


/\  /N 


Z  E  ^°ijk  "ij  Wk£ 
1  1  k  £  J 


(2.28) 


where 


:ijk£  =  N  l  Zij(n)  Zk£(n) 


(2.29) 


2  ,  2 


Maximizing  the  ratio  rx/rm  with  respect  to  w^^ 
results  in  the  set  of  equations 


-  - 


■ 
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A  £  £  c 
k  £ 


ijk 


(2.30) 


for  each  w .  .  . 

13 


Let  a  sequence  {y  }  ,  r=l,2/.../n  be  defined 


to  represent  the  set  Y-^/  Yp2 ' 


nl'  *  *  *  '  ynn' 


XX  ^ 

and  the  sequence  {w  }  ,  s=lf2,...,  n  be  defined  to 

s 

represent  w-^ ,  wny/  •••r  Then  for  each 

r,  equation  (2.30)  becomes 


n 

y  -  A  £  c  w  =  0 
Jr  ,  rs  s 

s=l 


(2.31) 


where 


;  =  -  E  zr(n)  zs(n) 

n 


(2.32) 


Hence  the  optimum  weights  w  can  be  expressed  in  the 
following  matrix  equation 


W  =  ic-b 


(2.33) 


and  the  processor  response  becomes 


r  (n)  = 


wT  Z 


Y  C  1  Z 


(2.34) 


The  approximate  response  can  be  obtained  in  the  form 


r(n)  =  Y,r  [D(  I  +  D  1  N)]  1  Z 


=  i  (y 


y  c 
S  sr . 

y  -  .  - )  z 

L  c  r 


r  C  Q 

rr  s^r  rr 


ss 


(2.35) 


— 


1 
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Careful  consideration  of  equation  (2.32)  reveals  that 
the  diagonal  elements  of  C  may,  or  may  not,  be  larger 
than  the  other  elements.  Furthermore  some  diagonal 
elements  may  even  be  zero.  Suppose  a  question  consists 
of  four  keywords  A,  B,  C  and  D,  then  the  sequence 
{z^}^  may  be  defined  as  the  different  possible  com¬ 
binations  of  the  four  keywords.  Thus,  designating 
A,  B,  C,  D  as  1,  2,  3,  4  respectively 

Z1  =  zl,l 
Z2  =  Zl,2 

Z12  =  Z3,4 

2 

In  general  n  keywords  form  a  total  of  n  combinations 
and  the  index  r  is  related  to  the  indexes  i  and  j  by 
the  expression 

r  =  (i-1)  n  +  j  .  (3.36) 

Consider  the  case  where  keywords  B,  D  never  occur 

together  in  any  title  of  the  collection,  then  zg  and  Z14 

will  be  zero.  Under  such  circumstance  the  response 

r (n)  in  equation  (2.35)  will  be  unbounded  since  cg^g 

and  c,  ,  .  are  both  zero.  It  is  therefore  essential  that 
4  f  J-  4 

all  possible  combinations  of  the  keywords  in  the  question 
exist  in  the  data  set. 


. 
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The  requirement  of  such  pairing  of  keywords  is 

a  rather  serious  restriction  to  a  user  when  formulating 

a  question.  It  is  difficult,  if  not  impossible,  to 

predict  and  use  only  those  keywords  that  form  pairs  in 

the  data  base.  Furthermore  even  if  such  a  set  is  known, 

the  pairs  may  not  be  all  included  in  the  question. 

This  is  because  a  set  of  n  keywords  will  lead  to  a 

2  2 

matrix  of  size  n  x  n  which  in  most  cases  is  not 
desirable  in  terms  of  the  amount  of  computer  core 
storage  required  and  the  length  of  computation  time 
unless  n  is  sufficiently  small. 

The  response  to  the  same  search  question  as  in 
section  2.4  is  summarized  below. 


Normalized 

relevance 

Keywords 
in  title 

Number  of 
titles  retrieved 

1.000 

REDUC,  AIRCR, 

NOISE  1 

0.407 

NOISE,  FANS 

3 

0.378 

NOISE,  AIRCR 

8 

0.335 

NOISE,  REDUC 

10 

0.310 

'  ENGIN,  NOISE 

4 

0.310 

NOISE,  CONTR 

7 

0.102 

AIRCR 

3 

36  total 

It  can  be  seen  that  a  slightly  improved  result 
over  the  previous  two  cases  is  obtained  in  that  two  more 


— 


I 


' 
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were  obtained.  It  must  be  mentioned  however , 
that  there  are  some  zero  diagonal  elements  in  the 
matrix  C.  In  view  of  this,  the  approximation  approach 
was  used  which  permitted  the  elimination  of  all  those 
terms  whose  combinations  do  not  exist  in  the  data  base. 
Because  of  this,  the  relevance  figures  quoted  must  be 
regarded  as  rough  estimates  only  and  not  be  consi¬ 
dered  as  exact. 
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CHAPTER  3 

LEAST  SQUARE  SEARCH 

3.1  The  search  criterion 

The  previous  chapter  contained  a  description  of 
use  of  the  matched  search  to  maximize  a  certain  res¬ 
ponse  with  respect  to  the  mean  response  to  all 
documents.  The  method  is  based  on  the  assumption 
that  a  'noisy'  background  tends  to  smooth  out  the 
average  properties  of  the  data.  The  average  total 
response  of  the  collection  may  then  be  taken  to  be 
that  for  non-relevant  titles.  Consequently,,  a  title 
whose  response  matches  the  question  is  considered  to 
be  a  hit. 

It  is  observed  that  the  weights  as  indicated  in 
equation  (2.1.2)  depend  as  much  on  the  average  response 
of  all  documents  (as  described  by  C)  as  on  the  form  of 
the  question  vector  X.  While  the  user  is  able  to 
change  the  keywords  in  the  question,  he  is  unable 
to  give  his  estimate  of  the  relative  importance  of 
the  keywords  in  relation  to  his  question.  The  result 
may  be  the  retrieval  of  some  documents  which  seem  to 
relate  to  the  given  vector  but  which  are  not,  in  fact, 
relevant  to  the  question. 


. 
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An  alternative  search  procedure  may  be  used 
whereby  the  assigned  weights  are  dependent  only  on 
the  form  of  the  question  Y  and  not  on  the  average 
response  of  the  collection.  In  addition  the  user  may 
be  asked  to  supply  his  estimate  of  the  relevance  of 
each  title  in  the  question.  To  each  title  y (m)  in 
Y  is  attached  a  number  f (m)  to  specify  the  relevance 
of  the  set  of  keywords,  the  title,  to  the  question. 

These  titles  are  processed  with  f (m) ,  and  the  optimum 
weights  will  be  determined  from  the  condition  that  the 
average  response  is  to  approximate  as  closely  as  possible 
the  estimated  relevance  of  Y. 

3.2  The  search  procedure 

The  response  to  a  title  y (m)  is 

r (m)  =  Z  w^  y^  (m)  (3.1) 

i 

and  the  error  from  the  estimated  value  f (m)  is 

Q'  =  Z  w^  (m)  -  f (m)  .  (3.2) 

i 

Suppose  the  question  Y  consists  of  M  titles;  then  the 
mean  square  error  Q  is  given  by 

Q  =  i  Z  [  Z  w±  Yi(m)  -  f(m)]2 
m  i 

=  Z  Z  u. .  w.  w.  -  2  Z  v.  w,  +  ^  Z  f 2 (m)  (3.3) 
i  -i  1  ^  i  m 


' 

' 

’ 

. 


. 
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where 


u .  .  =  u  .  .  =  -  E  y  ■  (m)  y  .  (m) 
li  ii  M  Ji  m 

J  J  m  J 

(3.4) 

and 


Vi  =  S'  E  f  (m)  Yi  (m) 
m 

(3.5) 

u. .  is  the  cross-correlation  measure  between 
13 

, ,  .  th 

the  l 

and  the  j ^  keywords  in  a  given  title,  while  may 
be  considered  to  be  the  average  relevance  of  each 
keyword  in  Y. 

The  response  error  Q  is  a  minimum  when  the 
weights  are  chosen  in  such  a  way  that  for  each  w^ 


the  following  equation  holds: 

lQ  .  =  0 

or 

(3.6) 

E  u  .  .  w  .  =  v  . 

13  3  1 

3 

(3.7) 

This  results  in  an  equation  of  the  form 


U  W  =  V 

(3.8) 

The  set  of  optimum  weights  are  obtained  as 


w  =  u  1  V 

(3.9) 

When  the  components  of  W  are  chosen  to  satisfy  equation 
(3.9)  it  follows  from  equation  (3.3)  that  the  minimum 


. 
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value  of  Q  is 


Q 


m 


h  I  f 2  (m) 

m 


vT  u  1  V 


(3.10) 


As  in  the  previous  chapter  it  proves  convenient  to 
approximate  U  ^  by  a  series.  Thus  let 


U  =  G  +  N 


(3.11) 


where  G  is  a  matrix  of  the  form 


g  .  .  =  u .  . 
^ii  n 


=  n 


13 


(3.12) 


and  N  is  a  matrix  whose  elements  are 


n  .  .  =  u .  . 

13  13 


n .  .  =  0 

li 


(3.13) 


Equation  (3.10)  may  then  be  written  as 


q  =  I  i  f2(m)  -  V_  ( I  +  G  1  N )  1  G  1  V  . 

mu  M  — T  — 

m 


(3.14) 


The  expression  for  Qm  may  be  expanded  into  a  power 
series  with  the  assumption  that 


u.  .  >>  u.  . 

li  13 


(3.15) 


so  that  high  order  terms  may  be  neglected.  The  result¬ 
ing  approximate  expression  for  Qm  is  as  follows: 


. 
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Qm  =  S'  Z  f  2  (m)  -  V  <3  1  V  +  V  G  1  N  G  1  V 
m 


S  *  f2(m>  -  £  (^-  - 

m  l  li  j^i  un  “jj 


u  .  .  v  . 

I  L.)  v. 

.  U  .  .  U  .  .  1 


(3.16) 


and  the  weights  are  approximated  by  the  expression 


v . 

w.  =  - - £ 

l  u .  .  .  , .  u  .  .  u.. 


u  .  .  v  . 
1 


(3.17) 


n 


•  /  •  U  •  •  U.  t  i 

37^1  ii  33 


Finally,  the  response  to  a  general  document  z (n) 
in  the  collection  is 


r(n)  =  E  wi  zi  (n)  =  VT  U,^1  Z_(n) 
i 


=  VT  G~ 1  Z  (n)  -  VT  G  1  N  G  1  Z(n) 


v  .  U  .  .  V  . 

?  -  .1  u3'  u1:0  zi(n)  • 

l  li  jfi  li  jj 


(3.18) 


3.3  The  question  form 

As  indicated  in  the  previous  section,  U  is  a 
square  matrix  and  is  symmetric  with  respect  to  the 
diagonal  elements.  The  search  procedure  then  requires 
the  question  in  some  suitable  form  so  that  U  will  be 
well-behaved . 

Suppose  there  are  k  distinct  keywords  in  Y,  and 
that  they  lead  to  k  linearly  independent  columns  (or 
rows)  in  U.  The  rank  of  U  is  then  k.  From  elementary 


■ 

. 
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matrix  analysis  [11],  there  exists  at  least  one  basic 
solution  to  equation  (3.8)  provided  m  :>,  k.  This  leads 
to  the  following  interesting  result: 

number  of  titles  >.  number  of  distinct  keywords.  (3.19) 

The  restriction  as  imposed  on  the  least  square 
search  may  not  be  serious  to  a  user  however.  When 
there  are  many  keywords  in  the  titles  of  Y ,  a  preliminary 
scan  may  be  made  to  eliminate  unimportant  words.  Alter¬ 
natively,  some  keywords  may  be  re-grouped  into  more 
hypothetical  titles.  In  either  case  a  question  form 
may  be  obtained  to  satisfy  (3.19). 

It  should  be  mentioned  that  the  matrix  C  that 
appeared  in  the  matched  search  was  constructed  from 
all  titles  of  the  data  base  and  not  from  titles  in  the 
question.  It  is  unlikely  that  any  real  system  will  have 
more  keywords  than  titles.  For  example  the  Chemical 
Titles  data  base  described  in  Chapter  5  is  such  that 
250,000  titles  contain  approximately  65,000  different 
words  and  many  of  these  words  would  not  be  used  as 
meaningful  keywords.  Thus  condition  (3.19)  will  be 
satisfied  automatically  in  the  matched  search. 

A  particularly  simple  question  form  results  when 
each  title  in  Y  contains  a  single  keyword  so  that  the 
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f (m)  value  associated  with  it  may  be  considered  as  its 
estimated  weight.  In  this  instance  the  task  of  cons¬ 
tructing  a  question  is  much  simplified.  There  is  the 
additional  advantage  that  any  desirable  value  may  be 
assigned  to  a  particular  keyword  and  not  to  a  collection 
of  them  as  would  be  in  the  normal  case.  Thus  the  user 
has  complete  control  in  weighting  each  keyword.  Subse¬ 
quently  U  becomes  a  diagonal  matrix  with  the  nonzero 
elements  equal  to  the  estimated  weights.  Although  the 
usual  matrix  inversion  is  no  longer  required  here,  it 
may  be  shown  that  under  such  circumstances  the  set  of 
optimum  weights  will  be  identical  to  the  set  of  f (m) 
values.  The  search  procedure  in  essence  reduces  to  a 
non-optimized  one  in  which  the  weights  are  totally 
specified  by  the  user.  In  either  case,  the  question 
is  governed  by  a  single  logic  'OR'  similar  to  the  matched 
search . 

3.4  An  example 

The  least  square  procedure  differs  from  that  of 
the  matched  search  in  that  only  one  pass  is  needed 
through  the  data  ba.nk.  Apart  from  this,  other  details 
follow  closely  the  steps  as  outlined  in  section  2.3  for 
the  matched  search. 

The  same  question  for  the  reduction  of  noise 
from  aircraft,  fans,  and  machines  may  be  formulated 
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as  follows: 


Question  Profile:  f (m) 
OR  MACHINE,  NOISE  3.00 
OR  ENGINE,  NOISE,  CONTROL  5.00 
OR  COMPRESS,  NOISE,  REDUCT  3.00 
OR  FANS,  NOISE  2.00 
OR  NOISE,  CONTROL,  AIRCRAFT  3.00 
OR  COMPRESS,  FANS  2.00 


OR  AIRCRAFT,  COMPRESS,  NOISE,  REDUCT  5.00 

There  are  seven  titles  in  all  with  six  distinct  key¬ 
words  in  accordance  with  condition  (3.19).  The 
estimated  relevances  f  (m)  reflect  the  importance 
placed  on  each  title.  The  value  assigned  to  each 
title  is  distributed  among  the  keywords  in  that 
title.  Thus  a  title  with  a  large  number  of  keywords 
will  normally  be  given  a  high  value  of  relevance 
unless  it  is  intended  to  have  negative  or  zero  weights 
attached  to  some  keywords . 

Results  of  using  the  least  square  search  with 
the  above  question  are  tabulated  below: 
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Table. 

3.1  Result  of  least 

square  search 

Normalized 

relevance 

Keywords  in 
title 

Number  of 
titles  retrieved 

1.00 

ENG IN  ,  NOISE 

4 

0.80 

ENGINE 

7 

0.80 

REDUC  ,  AIRCR,  NOISE  1 

0.60 

AIRCR,  NOISE 

8 

0.40 

AIRCR 

3 

0.40 

FANS,  NOISE 

4 

0 . 40 

REDUC,  NOISE 

7 

34  total 

These  34  titles  are  identical  with  those  obtained  by 


the  matched  search. 
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CHAPTER  4 

TWO  PARAMETER  QUESTION 


4 . 1  The  search  criterion 

As  V7as  discussed  in  the  matched  search  a  question 
may  be  formed  simply  by  direct  coding  of  titles  from 
actual  or  hypothetical  documents.  In  the  case  of 
actual  documents  that  are  known  to  be  relevant,  only 
minor  editing  is  required  to  remove  such  non-relevant 
words  as  the  general  'articles'  and  'prepositions'. 

Thus  little  human  effort  is  involved.  As  for  the  least 
square  search  some  care  is  needed  to  pick  keywords  from 
titles  so  that  a  well-behaved  matrix  is  eventually 
obtained.  A  question  form  which  needs  even  more  human 
involvement  will  be  discussed  here,  together  with  a 
new  search  procedure  to  deal  with  such  a  question. 

The  question  will  be  supposed  to  consist  of  2 
'parameters'  each  of  which  contains  a  set  of  keywords 
represented  by  a  vector.  The  need  for  a  2— parameter 
search  may  arise  in  situations  where  titles  of  highly 
specialized  topics  are  to  be  retrieved.  As  will  be 
seen  below,  a  document  under  examination  must  contain 
keywords  that  match  at  least  one  keyword  in  each  of  the 
two  parameters  before  the  document  can  be  considered 
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further  for  its  relevance.  Normally  the  number  of  hits 
will  be  reduced,  and  it  is  hoped  that  through  the  use 
of  this  procedure  titles  not  directly  relevant  will  be 
kept  at  a  minimum  in  the  output. 

4.2  Optimization  procedure 

Suppose  a  document  z (n)  is  processed  by  the  evalu¬ 
ation  of  two  parameters  given  by 


and 


£  w .  z  .  (n) 
1  l 
l 


£  w  f  ^  ^  z  .  (n) 
j  3  3 


(4.1) 


where  r  ^  and  r  ^  are  the  responses  of  the  document 
to  the  two  sets  of  keywords  in  the  first  and  second 
parameter  respectively.  The  summation  in  i  is  over  all 
keywords  associated  with  the  set  belonging  to  the  first 
parameter,  and  the  summation  in  j  is  over  all  keywords 
associated  with  the  second  parameter. 

Let  the  processor  response  to  the  n^  document 


be 

R  (n)  =  r  ^  (n)  .  r  ^  (n) 


(4.2) 


R  (n)  will  be  nonzero  if,  and  only  if,  both  r^  (n)  and 
(2) 

r  (n)  are  nonzero. 
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The  mean  response  of  the  processor  to  the  collec¬ 
tion  of  N  titles  is 


Rm  =  N  Z  R(n> 
n 


~  Z  r  (1)  (n)  r  (2)  (n) 
n 


Z  Z  c.  .  w.(1)  w!2) 

in  l  3 
l  j  J  J 


(4.3) 


where 


c .  .  =  -  Z  z .  (n)  z  .  (n) 
13  N  1  3 

j  n  J 


(4.4) 


and  the  W's  are  the  weights  associated  with  the  two 
parameters.  It  should  be  noted  that  c^  will  not,  in 
general,  equal  c^  since  the  indices  i  and  j  range 
over  two  different  sets  of  keywords.  The  sets  may,  or 
may  not,  have  some  keywords  in  common. 

Suppose  the  question  is  denoted  by  the  two 
vectors  and  x^2\-  the  system  response  is  then 


R  =  r 
x 


(1) 


(2) 


Z  Z  „.(1)  w(w>  X.'1’  x<2> 

•  •  1  3  1  3 

1  j  J 


(4.5) 


It  will  be  assumed  that  the  retrieved  criterion 

requires  that  R  is  to  be  a  maximum  in  comparison  to 

R  .  The  condition  for  the  response  R  to  be  stationary 
m  x 
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with  respect  to  the  mean  response  R  is  that  and 

L  m  i 

(2) 

Wj  be  chosen  to  satisfy  the  equations 


_JL__ 

awJ1^  Rm 


=  0 


and 


9w  j 


(2) 


R 

(~) 

m 


=  0 


(4.6) 


These  result  in  two  sets  of  equations 


x.(1>  I  W<2>  x<2> 

1  j  3  3 


X  Z  c. .  w 
3 


(2) 


=  0,  for  each  i  (4.7) 


and 

x<2)  Z  w(1>  x.(1) 

3  i  1 


X  Z  c.  .  w. 


(1) 


0 ,  for  each  j  (4.8) 


where  X  denotes  the  stationary  value  of  Rx/Rm- 


Equation  (4.7)  may  be  rearranged  into 
X  Z  c.  .  w(2) 

xu)  _  _j  13  _L, 

l 


Z  w!2)  x<2> 

j  3  3 


(4.9) 


The  denominator . is  independent  of  the  index  j  and  thus 


it  proves  convenient  to  introduce  new  variables  v . 


(2) 


defined  by 


v  . 
D 


(2)  _ 


X  , ,  (2 )  _  ,  (2)  „(2) 

7TTT - 7-^-r*  .  W  .  —  A  W  . 

(2)  (2)  j  J 


l  w .  '  x 

j  3  3 


(4.10) 
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Similarly,  from  equation  (4.8)  the  variables  v|^ 
may  be  defined  by 


V(1)  =  x  . 

1  Z  W.(1)  X.(1) 

w(l)  _  1  (1)  w(l) 

W^  —  A  w^ 

(4.11) 

i  1  1 

The 

set  of  equations  then 

becomes 

I  c.  v<2)  =  X.'1’ 

1]  1  1 

3 

for  each  i 

(4.12) 

and 

E  c.  v.(1>  =  x‘2> 
i  ^  1  3 

for  each  j 

(4.13) 

Thus 

in  matrix  form. 

c  v(2)  =  x(1) 

(4.14) 

CTV(1)  =  x(2) 

(4.15) 

As  indicated  in  equations  (4.10)  and  (4.11),  the 
V's  are  directly  related  to  the  W's,  and  apart  from  a 
constant  of  proportionality  the  two  equations  have 
identical  form.  Subsequently  the  V's  may  be  considered 
as  replacing  the  W's  as  the  set  of  optimum  weights 
determined  by  the  optimization  process. 

The  optimum  weights  are  found  from  the  equations 


-  cT  1  X^  (4.16) 

(2)  _  C_1  x  ^ 


and  V 


(4.17) 
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4.3  Question  formulation 

As  has  been  pointed  out  previously  C  will  not  be 
a  symmetric  matrix,  and  for  the  existence  of  C  \ 
it  is  necessary  that  the  two  parameters  contain  the 
same  number  of  keywords.  This  does  not  seem  to  be  a 
serious  restriction  since  the  parameters  may  be  ex¬ 
tended  to  meet  this  requirement  by  the  addition  of  extra 
keywords.  Alternatively  some  keywords  in  one  parameter 
may  be  included  to  the  other  set.  Thus  some  care  should 
be  taken  in  selecting  keywords  for  use  in  a  question. 

Once  a  group  of  keywords  has  been  chosen  to  repre¬ 
sent  a  question,  it  is  necessary  to  separate  them  into 
two  sets  to  fit  the  search  criterion.  Normally  it  will 
be  desirable  for  keywords  of  a  similar  nature  to  be 
placed  in  the  same  set.  An  example  is  provided  by 
words  that  are  synonyms.  When  these  are  combined  with 
keywords  in  the  other  set  there  result  word  pairs  of 
similar  meaning.  The  use  of  synonyms  in  searching  has 
been  found  to  be  beneficial,  since  many  relevant  items 
will  not  otherwise  be  retrieved. 

Another  fact  that  may  be  taken  into  consideration 
is  the  fact  that  many  words  tend  to  associate  highly 
with  certain  other  words.  If  such  associated  word  pairs 
are  included  in  the  question  it  may  be  advisable  not  to 
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put  them  in  the  same  set  since  words  in  the  same 
parameter  are  not  included  in  the  computation  of  C. 

The  matrix  C  will  not  in  general  be  unique  for 
a  given  question.  It  is  clear  that  C_1  may  have 
different  values  depending  on  the  manner  in  which 
the  keywords  are  arranged  in  the  two  parameters. 

In  most  cases  interesting  variations  in  C  may  be 
obtained  by  the  interchange  of  keywords  between  the 
two  sets.  Other  possible  variations  result  from  the 
interchange  of  keyworas  within  the  same  set.  It  should 
be  mentioned  that  given  a  group  of  keywords ,  it  is 
generally  not  known,  a  priori,  how  to  construct  the 
"best"  question  from  these  keywords. 

In  contrast  to  either  the  matched  search  or  the 
least  square  search,  the  question  logic  involves  'AND' 
and  OR  .  Thus  a  question  for  the  2-parameter  search 
has  the  following  form: 


Parameter 

1  : 

A 

OR 

B 

OR 

C 

Parameter 

2  : 

D 

OR 

E 

OR 

F 

4 . 4  An  example 

The  two  parameter  search  requires  two  passes 
through  the  data  base,  the  first  time  for  the  collec¬ 
tion  of  statistical  count  of  the  keywords  in  the  question, 


t j  rl j  i< i'  ff  ^3*  .-*®!j| 


. 


. 


44 


and  the  second  time  for  matching  keywords  in  titles 
with  those  in  the  question.  Except  for  some  minor 
changes  in  detail,  the  procedural  steps  as  outlined 
in  the  flow-chart  for  the  matched  search  may  be  applied 
here . 

A  possible  query  consisting  of  six  keywords  may 
be  constructed  from  the  original  set  of  titles  for 
the  search  of  noise  from  machine,  aircraft  and  fans. 

The  keywords  are  arranged  in  the  two  parameters  as 
follows  : 

Parameter  1  :  AIRCR  ENGIN  REDUC 
Parameter  2  :  ACOUS  NOISE  SOUND 

Results  are  shown  in  Table  4.1. 

T able  4.1  Results  of  2-parameter  search 

Normalized  Keywords  in  Number  of 

relevance  title  documents  retrieved 


1.000 

SOUND , 

ENGIN  , 

NOISE 

1 

.  650 

SOUND, 

ENGIN 

3 

.  350 

- ENGIN 

,  NOISE 

3 

.180 

SOUND, 

AIRCR 

1 

.105 

AIRCR, 

NOISE 

7 

.  100 

ACOUS, 

ENGIN 

3 

.075 

ACOUS, 

AIRCR, 

NOISE 

1 

.070 

NOISE, 

REDUC 

8 

.050 

REDUC , 

ACOUS, 

NOISE 

1 

.050 

REDUC, 

AIRCR, 

NOISE 

1 

29  total 


I 


The  five  titles  that  were  missed  were  the  following 


(1)  Noise  of  centrifugal  fans. 

(2)  Measurement  of  noise  from  ducted  fans. 

(3)  Real  noise  originated  from  power  speed  ducted 
centrifugal  fans. 

(4)  Hearing  of  naval  aircraft  maintenance  personnel 

(5)  Further  experiments  judging  noisy  aircraft 
flight. 
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CHAPTER  5 

COMPARISON  AND  CONCLUSION 

5.1  Comparative  study  of  search  procedures 

The  previous  chapters  were  devoted  to  the  analysis 
of  different  optimum  criteria  for  document  search,  with 
results  of  an  example  given  for  each  method.  This 
example  was  more  for  the  purpose  of  illustration  than 
for  comparative  studies.  With  more  search  questions 
it  will  be  possible  to  obtain  evidence  of  advantages 
and  drawbacks  of  each  criterion  by  analyzing  a  variety 
of  results.  It  may  also  be  possible  to  reach  a  conclu¬ 
sion  to  indicate  the  preferable  choice  among  the  four 
procedures . 

In  the  following  discussion  the  exact  solution 
of  the  matched  criterion,  instead  of  the  approximate 
version,  will  be  used  for  analysis  and  comparison  with 
other  criteria.  This  is  because  the  examples  in  the 
present  investigation  involve  a  small  number  of  keywords 
and  the  former  version  is  then  capable  of  handling  such 
questions  efficiently  in  terms  of  computer  time  and 
storage.  As  has  been  mentioned  previously,  whenever 
a  question  requires  lengthy  computing  time  the  appro¬ 
ximate  method  proves  to  be  more  economical  and  convenient 


to  use. 
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All  desirable  output  responses  are  trimmed  and 
tabulated  according  to  the  following  scheme.  The. 
titles  are  sorted  in  descending  order  of  their  total 
relevances.  These  values  are  normalized  as  fractions 
in  relation  to  the  largest  value  which  appears  at  the 
top  of  the  list  and  which  becomes  equal  to  1 .  To 
indicate  the  nature  of  each  title  retrieved ,  an  entry 
is  provided  to  list  the  keywords  in  it  and  contained 
in  the  question.  In  the  following  questions,  titles 
were  taken  from  the  Cumulative  Index  of  the  Journal  of 
the  Acoustical  Society  of  America . 

Question  A 

In  an  attempt  to  search  for  articles  that  relate 
to  the  design  of  auditoria,  or  music  halls  and  rooms, 
or  studios,  and  with  emphasis  placed  on  sound  absorption 
and  reverberation,  ten  relevant  titles  were  selected  for 
use  as  a  question  for  the  matched  search.  These  titles 
are  listed  below.  They  contain  a  total  of  26  distinct 
keywords  as  underlined,  and  resulting  hits  are  summarized 


in  Table  5.1. 
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(Al) 

(A2) 

(A3) 

(A4) 

(A5) 

(A6) 

(A7) 

(A8) 

(A9) 

(A10) 


The  titles  are  the  following: 

Sound  amplification  in  reverberant  spaces . 

Musi cal- acoustical  criteria  of  concert  halls. 

Models  as  an  aid  in  the  acoustic  design  of 
auditoria . 

The  acoustic  design  of  studio  and  listening 
rooms . 

Acoustics  for  music  performance . 

Formation  and  testing  of  acoustic  conditions  in 
rooms . 

Acoustic  considerations  in  the  design  of  recording 
studios . 

The  design  of  small  studios  and  listening  rooms . 

The  effect  of  absorbent  walls  of  a  closed  space 
on  the  uniformity  of  the  sound  field . 

The  acoustic  conditions  accepted  by  listeners  in 
an  auditorium. 


/ 
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Table  5.1  Matched  Search  A 


Normalized 

relevance 


Keywords  in 
title 


Number  of 
titles  retrieved 


1.000 

ACOUS , 

PERFO , 

CRITE, 

MUSIC 

CONCE, 

HALLS, 

1 

0.994 

DESIG, 

CONCE, 

HALLS 

1 

0.921 

ACOUS , 

STUDI, 

ROOMS , 

MUSIC 

1 

0.710 

MUSIC, 

ROOMS 

1 

0 . 700 

SOUND, 

CLOSE, 

ROOMS 

1 

0.673 

ACOUS, 

HALLS 

1 

0 . 670 

REVER , 

ROOMS , 

HALLS 

1 

0 . 660 

ABSOR, 

HALLS 

1 

0 . 639 

REVER , 

HALLS 

1 

0.616 

CRITE, 

ACOUS, 

ROOMS 

1 

0.536 

ACOUS, 

ROOMS 

1 

0.533 

ACOUS, 

FIELD, 

ROOMS 

2 

0.526 

EFFEC , 

DESIG, 

CONS  I 

1 

0.521 

SOUND, 

ABSOR, 

ROOMS 

1 

0.519 

ROOMS 

6 

0.502 

REVER, 

ROOMS 

4 

0.425 

STUDI, 

ACOUS, 

CONS  I 

1 

0.405 

MODEL, 

ACOUS, 

DESIG, 

AUDIT 

1 

0.384 

ACOUS, 

DESIG, 

PERFO 

1 

0 .381 

ACOUS, 

DESIG, 

AUDIT 

3 

0 . 360 

AUDIT, 

DESIG 

1 

Cut-off  at  0.360 


32  total 
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For  the  cross-correlated  matched  search,  out  of 
the  10  original  titles  7  were  selected  (A2 ,  A3,  A4 ,  A5 , 
A7 ,  A8  and  A9 )  containing  9  distinct  keywords  to  form 
the  question.  They  were:  ABSOR,  AUDIT,  CONCE,  DESIG, 
HALLS,  MUSIC,  REVER,  ROOMS,  STUDI.  Results  were: 

Table  5.2  Cross-correlated  matched  search  A 


Normalized  Keywords 

relevance  (N.R.)  title 


1.000 

CONCE, 

HALLS 

0.976 

DESIG, 

CONCE 

0 .908 

STUDI, 

ROOMS 

0.718 

ABSOR, 

AUDIT 

0.595 

CONCE, 

AUDIT 

0.554 

ABSOR, 

CONCE 

0.495 

ABSOR, 

REVER 

0.477 

DESIG, 

AUDIT 

0 . 445 

ABSOR, 

HALLS 

0.431 

ABSOR, 

ROOMS 

0 . 313 

DESIG, 

CONCE 

0.274 

.  STUDI, 

REVER 

0.274 

MUSIC, 

ROOMS 

0.269 

REVER, 

HALLS 

0.255 

REVER, 

ROOMS 

0 .034 

ROOMS 

in  Number  of 

documents  retrieved 

MUSIC  1 

HALLS  1 

MUSIC  1 

1 

1 

1 

5 

5 
1 
1 
1 
1 
1 
1 

6 

10 

38  total 


Cut-off  at  0.034 
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In  the  least  square  method  9  titles  (all  except 
A9)  and  8  keywords  were  used,  and  the  question  profile 
was  chosen  as  follows: 


f  (m) 

REVER, 

AUDIT 

4.0 

MUSIC, 

HALLS 

o 

• 

DESIG, 

AUDIT 

4.0 

DESIG, 

STUD I,  ROOMS 

6.0 

MUSIC 

4.0 

DESIG, 

STUD  I 

4.0 

STUD I, 

ROOMS 

4.0 

CONCE, 

HALLS 

3.0 

AUDIT 

o 

• 

i — 1 

Table  5.3  Least  Square  Search  A 


Normalized 

relevance 

Keywords 

title 

in 

Number  of 
titles  retreieved 

1.000 

STUDI, 

ROOMS, 

MUSIC 

1 

0.902 

CONCE, 

HALLS , 

MUSIC 

1 

0 . 806 

MUSIC, 

ROOMS 

1 

0.710 

DESIG, 

CONCE, 

HALLS 

1 

0.678 

•  DESIG, 

REVER 

1 

0.645 

REVER , 

ROOMS 

4 

0.613 

REVER , 

HALLS 

1 

0.550 

REVER, 

STUDI 

1 

0 .484 

DESIG, 

AUDIT 

5 

0.452 

DESIG, 

CONCE 

2 

0.355 

REVER 

2 

20  total 

Cut-off  at 

0 . 355 

' 
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The  question  for  the  2-parameter  search  was 
arranged  into  the  form 

PI  :  AUDIT,  ROOMS,  HALLS,  STUDI 
P2  :  CONCE,  MUSIC,  PERFO ,  REVER . 

A  total  of  five  titles  were  produced  as  hits; 
they  are  listed  in  table  5.4. 

Table  5.4  2-parameter  search  A 

Normalized  Keywords  in  Number  of 

relevance  title  titles  retrieved 


1.000 

PERFO, 

STUDI 

1 

0 .666 

CONCE, 

HALLS, 

PERFO,  MUSIC  1 

0.333 

STUDI, 

REVER 

1 

0 . 333 

REVER, 

HALLS 

1 

0.333 

STUDI, 

MUSIC, 

ROOMS  1 

5  total 


Cut-off  at  0.333 
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Question  B 

This  request  is  concerned  with  the  horn  type 
loudspeakers  of  a  stereo  playback  system.  The  follow¬ 
ing  ten  titles  form  the  basis  of  the  question. 

(Bl)  Evaluation  of  a  modulated  air-flow  loudspeaker. 

(B2)  Evaluation  of  a  stereophonic  loudspeaker  by 
multiple  microphone  arrays . 

(B3)  Method  of  improving  acoustic  transmission  in 
folded  horns . 

(B4)  Generation  of  intense  audio  sound  fields  utili¬ 
zing  arrays  of  multiple  driver  horns . 

(B5)  New  stereophonic  projection  console . 

(B6)  Theory  of  isophonic  speaker  systems . 

(B7)  Experimental  full  range  electrostatic  speaker . 

(B8)  Crossover  problems  in  two  way  loudspeaker  systems . 
(B9)  Infinite  baffle  comprising  a  spherical  shell  of 
foamed  plastic. 

(BIO)  Improved  directional  distribution  from  horn  arrays. 

The  underlined  keywords  were  used  for  the  matched 
search,  while  the  cross  correlated  version  retained 
only  nine  keywords  in  the  same  number  of  titles,  namely, 
AUDIO,  BAFFL ,  HORNS,  HORN,  LOUDS,  MODUL ,  SPEAK,  STERE, 
SYSTE.  Results  are  as  shown  in  Tables  5.5  and  5.6 
respectively . 
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Table  5. 

5  Matched  Search  B 

Normalized 

relevance 

Keywords  in 
title 

Number  < 
titles  retr 

1.000 

HORN , 

SYSTE ,  HORNS 

1 

0 .920 

STERE , 

LOUDS 

1 

0.728 

AUDIO, 

SOUND,  FIELD,  MULTI 

f 

DRIVE, 

HORNS 

1 

0 . 564 

SPEAK, 

SYSTE 

1 

0.558 

ELECT, 

SPEAK 

1 

0.558 

SPEAK 

3 

0.506 

HORN 

4 

0.484 

HORNS 

6 

0 .484 

ACOUS, 

TRANS,  HORNS 

1 

0.476 

HORNS , 

TRANS 

1 

0.472 

ACOUS, 

HORNS,  SOUND 

1 

0.470 

LOUDS, 

MODUL 

3 

0.461 

LOUDS, 

SYSTE 

1 

0.460 

NOISE, 

TWO,  LOUDS,  SYSTE 

1 

0.460 

ELECT , 

LOUDS 

1 

0.449 

LOUDS, 

NOISE 

1 

0 .445 

TWO,  LOUDS 

3 

31  total 


Cut-off  at  0.445 
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Table  5.6  Cross-correlated  search  B 


Normalized 

relevance 

Keywords  in 
title 

Number  of 
titles  retrieved 

1.000 

HORN,  SYSTE,  HORNS 

1 

0 . 366 

LOUDS,  SYSTE 

2 

0 . 342 

STERE,  SYSTE 

1 

0 . 326 

SPEAK,  SYSTE 

1 

0.134 

LOUDS,  MODUL 

3 

0.120 

STERE,  LOUDS 

1 

0.116 

MODUL,  AUDIO 

3 

0.090 

AUDIO,  HORNS 

1 

0.031 

HORN 

4 

0.021 

STERE 

5 

0.020 

LOUDS 

13 

0 . 016 

HORNS 

9 

0.013 

SPEAK 

4 

48  total 


Cut-off  at  0.013 

Titles  Bl ,  B2,  B3,  B4  ,  B5,  B6 ,  B9  and  BIO  contain¬ 
ing  a  total  of  eight  distinct  keywords  were  chosen  for 
the  least  square  search.  These  keywords  together  with 
the  corresponding  relevance  estimates  are  as  follows: 
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LOUDS 

3.0 

STERE , 

LOUDS 

5.0 

HORNS 

o 

• 

CM 

DRIVE , 

HORNS 

3.0 

STERE, 

CONSO 

3.0 

SPEAK 

2.0 

BAFFL 

o 

• 

CM 

HORN 

O 

• 

CM 

This  results  in 

Table 

42  titles  as  in  the  following 

5.7  Least  square  search  B 

table : 

Normalized 

Keywords  in 

Number  of 

relevance 

title 

titles 

retrieved 

1.000 

STERE,  LOUDS 

1 

0.800 

HORN,  HORNS 

1 

0.600 

DRIVE,  HORNS 

1 

0 . 600 

LOUDS 

18 

0.600 

DRIVE,  BAFFL 

2 

0.400 

HORNS 

9 

0.40 

HORN 

3 

0.40 

SPEAK 

4 

0.40 

BAFFL 

3 

42  total 


Cut-off  at  0.40 


' 
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Another  set  of  8  keywords  was  selected  for  the 
2-P  search.  They  were  specified  as 


PI  :  LOUDS,  MULTI ,  INF IN,  DRIVE 
P2  :  SYSTE,  MODUL,  HORNS,  STERE 


Table  5 . 8 

summarizes  the 

results . 

Table  5.8  2- 

-P  Search  B 

Normalized 

relevance 

Keywords  in 
title 

Number  of 
titles  retrieved 

1.000 

STERE, 

LOUDS,  MULTI 

1 

1.000 

MULTI, 

DRIVE,  HORNS 

1 

0.600 

MULTI, 

SYSTE 

3 

0 . 533 

INF IN, 

SYSTE 

2 

0.60 

LOUDS, 

SYSTE 

2 

0.20 

LOUDS, 

MODUL 

3 

0.15 

DRIVE, 

MODUL 

1 

13  total 


Cut-off  at  0.15 

Question  C 

It  will  be  supposed  that  the  required  documents 
relate  to  the  effect  of  ultrasonic  waves  in  light 
beams,  with  attention  focused  on  the  amplitude,  reflec¬ 
tion  or  diffraction  variation  of  the  latter.  Ten  titles 
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were  selected  as  the  basic  reference  with  a  total  of 

twenty  basic  keywords  used  in  the  matched  search. 

They  are  as  follows: 

(Cl)  Diffraction  of  light  by  two  ultrasonic  waves  of 
the  same  frequency . 

(02)  Diffraction  of  electromagnetic  waves  by  sound  waves. 

(03)  Optical  effects  of  ultrasonic  waves  producing  phase 
and  amplitude  modulation . 

(C 4 )  Determination  of  finite  amplitude  distortion  by 
light  diffraction . 

(05)  Effect  of  ultrasonic  waves  on  concentration  gradients . 

(06)  Optical  methods  for  the  measurement  of  sound  pressure 
in  liquids. 

(C7)  Determination  of  finite  amplitude  distortion  by 
light  diffraction . 

(08)  Electromagnetic  reflection  from  sound  waves. 

(09)  Diffraction  of  light  by  ultrasonic  waves  of  various 
standing  wave  ratios . 

(CIO)  Diffraction  of  wide  and  narrow  light  beams  by 

distorted  finite  amplitude  progressive  ultrasonic 
waves  in  water. 


Results  of  this  search  appear  in  Table  5.9. 
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Table  5 . 9  Matched  Search  C 


Normalized 

relevance 


Keywords  in 
title 


Number  of 
titles  retrieved 


1.000 

DIFFR , 

LIGHT, 

ULTRA, 

RATIO 

1 

0.905 

DIFFR , 

LIGHT, 

BEAMS , 

DISTO, 

FIN IT, 

AMPLI, 

ULTRA, 

WATER , 

DIFFR 

1 

0 . 798 

LIGHT, 

ULTRA, 

BEAMS , 

FREQU 

1 

0 .784 

FINIT, 

AMPLI, 

DISTO, 

LIGHT, 

DIFFR 

1 

0 .780 

ULTRA, 

LIGHT, 

DIFFR 

1 

0.756 

DIFFR, 

LIGHT, 

AMPLI, 

ULTRA 

2 

0 . 750 

DIFFR, 

LIGHT, 

ULTRA, 

DISTO, 

LIQUI 

1 

0.744 

LIGHT, 

DIFFR, 

DISTO, 

ULTRA 

1 

0.723 

ULTRA, 

DIFFR, 

LIGHT 

2 

0.720 

ULTRA, 

DIFFR, 

LIGHT, 

FINIT, 

AMPLI 

2 

0 . 695 

EFFEC , 

LIGHT, 

DIFFR, 

ULTRA 

2 

0 .692 

DIFFR, 

LIGHT 

2 

0 .688 

DIFFR, 

LIGHT, 

ULTRA 

8 

25  total 


Cut-off  at  0.688 
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With  the  cross-correlated  search,  the  following 
9  keywords  were  used:  AMPLI,  BEAMS,  DIFFR ,  ELECT,  LIGHT, 
MAGNE,  OPTIC,  REFLEC,  ULTRA.  Results  are  presented  as 
follows : 

Table  5.10  Cross-correlated  search  C 


Normalized 

relevance 


Keywords  in 
title 


Number  of 
titles  retrieved 


1.000 

DIFFR, 

LIGHT, 

BEAMS , 

AMPLI,  ULTRA  1 

.675 

LIGHT, 

DIFFR, 

AMPLI, 

ULTRA  4 

.633 

DIFFR, 

LIGHT, 

ULTRA, 

BEAMS  1 

.  362 

AMPLI, 

LIGHT, 

DIFFR 

1 

.  355 

LIGHT, 

DIFFR, 

ULTRA 

17 

.  333 

DIFFR, 

ULTRA 

1 

.  320 

ULTRA, 

BEAMS , 

DIFFR 

1 

.302 

AMPLI, 

ULTRA, 

OPTIC 

4 

.290 

OPTIC, 

MAGNE, 

ULTRA 

1 

.286 

OPTIC, 

ULTRA 

1 

.286 

ULTRA, 

BEAMS 

1 

33  total 


Cut-off  at  .286 


The  question  for  the  least  square  search  was  chosen 
as  follows :- 

DIFFR,  LIGHT  3.0 


DIFFR,  SLECT 


3.0 
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OPTIC, 

ULTRA , 

AMPLI 

5.0 

AMPLI, 

LIGHT, 

DIFFR 

5.0 

ULTRA 

1.0 

OPTIC 

2.0 

ELECT , 

MAGNE 

3.0 

DIFFR, 

ULTRA 

3.0 

DIFFR, 

ULTRA 

LIGHT, 

BEAMS , 

6.0 

It  is  made  up  of  9  titles  and  8  keywords. 


Table  5.11  Least  square  search  C 

Normalized  Keywords  in  Number  of 

relevance  title  titles  retrieved 


1.000 

DIFFR, 

LIGHT, 

BEAMS , 

AMPLI,  ULTRA  1 

.75 

ULTRA, 

LIGHT, 

DIFFR, 

BEAMS  1 

.75 

DIFFR, 

LIGHT, 

AMPLI, 

ULTRA  4 

.625 

OPTIC, 

MAGNE, 

ULTRA 

1 

.  625 

ULTRA, 

BEAMS, 

DIFFR 

1 

.625 

OPTIC, 

AMPLI, 

ULTRA 

1 

.625 

AMPLI, 

LIGHT, 

DIFFR 

1 

.625 

LIGHT, 

DIFFR, 

ULTRA 

17 

31  total 

Cut-off  at  .625 
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The  2-P  search  used  the  following  question: 
PI  :  LIGHT,  OPTIC,  ULTRA,  ELECT 
P2  :  BEAMS,  AMPLI,  DIFFR,  PRESS 

Table  .5.12  2-P  search  C 


Normalized  Keywords  in  Number  of 

relevance  title  titles  retrieved 


1.000 

ULTRA, 

BEAMS 

1 

0.864 

ULTRA, 

BEAMS , 

DIFFR 

1 

0 . 730 

OPTIC, 

DIFFR 

3 

0.574 

LIGHT, 

BEAMS 

1 

0.476 

OPTIC, 

PRESS 

1 

0.394 

DIFFR, 

LIGHT, 

BEAMS , 

AMPLI, 

ULTRA 

1 

0 . 368 

DIFFR, 

ULTRA, 

BEAMS 

1 

9  total 

Question  D 

The  query  is  made  for  recording  techniques  with 
magnetic  tapes  or  disks.  With  a  set  of  the  following 
8  titles,  the  matched  search  produced  20  relevant 
documents  as  in  Table  5.13. 

(Dl)  Stereophonic  and  quasi-stereophonic  reproduction. 

(D2)  Improved  quasi-stereophonic  and  colorless  ai ti f icial 
reverberation . 
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(D3)  Analysis  of  a  piezoelectric  phonograph  transducer 
as  a  function  of  the  clamping  position. 

(D4)  Surface  and  groove  noise  in  disk  recording  media . 

(D5)  Determination  of  the  recording  performance  of  a 
tape  from  its  magnetic  properties. 

(D6)  Experimental  and  theoretical  investigation  of  the 
magnetic  properties  of  iron  oxide  recording  tape . 

(D7)  Method  and  apparatus  for  dual  sound  track  recording . 

(D 8 )  Method  and  apparatus  for  magnetic  recording  and 
reproducing . 


Table  5.13  Matched  search  D 


Normalized 

relevance 

Keywords  in 
title 

1.000 

RECOR,  TAPE ,  IRON,  OXIDE 

0.699 

SURFA,  GROOV ,  NOISE,  DISK 

RECOR,  MUSIC 

0.608 

MAGNE,  RECOR,  TAPE 

0.594 

RECOR,  PERFO ,  TAPE 

0 . 580 

TAPE,  RECOR 

0.574 

SOUND,  TAPE,  RECOR 

0 .482 

PIEZO,  PHONO,  TRANS 

0.465 

PHONO 

0.448 

STERE,  SOUND,  REPRO 

0.410 

MAGNE,  TAPE 

0.377 

TRACK 

Cut-off  at  0.377 


Number  of 
titles  retrieved 


1 

2 

1 

1 

1 

1 

1 

3 

3 
1 

4 


20  total 


■ 
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With  the  cross-correlated  search,  the  following 
8  keywords  from  the  8  titles  were  used: 

DUAL,  GROOV ,  PHONO,  RECOR,  REPRO,  STERE, 

TAPE,  TRACK. 

Table  5.14  Cross-correlated  search  D 


Normalized 

relevance 

Keywords  in 
title 

Number  of 
titles  retrieved 

1.000 

RECOR,  TAPE 

5 

0 . 902 

GROOV,  RECOR 

2 

0.279 

TAPE 

1 

0.257 

STERE,  REPRO 

3 

0.222 

GROOV 

1 

0.0159 

REPRO 

5 

0.0102 

STERE 

11 

0.0047 

RECOR 

5 

33  total 


The  search  question  for  the  least  square  method 
was  as  follows : 


STERE, 

REPRO 

2.0 

STERE , 

DISK 

2.0 

PHONO , 

DISK 

2.0 

GROOV, 

DISK,  RECOR 

3.0 

RECOR, 

TAPE 

2.0 

RECOR 

• 

O 

, 
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RECOR,  REPRO  2.0 

STERE,  TAPE  2.0 

These  contain  7  keywords  in  the  8  titles, 

of  which  are  in  Table  5.15. 

Table  5.15  Least  square  search 

the  results 

»> 

D 

Normalized 

Keywords  in 

Number  of 

relevance 

title 

titles  retrieved 

1.000 

GROOV,  DISK,  RECOR 

2 

0.667 

RECOR,  TAPE 

1 

0.667 

STERE,  REPRO 

3 

0 .667 

RECOR,  TAPE 

4 

0.667 

DISC,  RECOR 

1 

0.333  . 

PHONO 

4 

0.333 

GROOV 

1 

0.333 

STERE 

1 

0.333 

RECOR 

3 

20  total 

As  for 

the  2-P  search,  the 

2  vectors 

were  as 

follows  : 

PI  :  DISK,  GROOV ,  RECOR,  NOISE 


P2  :  SURFA,  TAPE, 


MAGNE,  RECOR 


i 

.. 
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Table  5.16  2-P  search  D 


Normalized  Keywords  in  Number  of 

relevance  title  titles  retrieved 


1.000 

SURFA, 

GROOV , 

NOISE,  DISK,  RECOR 

2 

0 . 464 

NOISE, 

RECOR, 

TAPE 

1 

0.400 

.MAGNE, 

RECOR 

1 

0.392 

MAGNE, 

NOISE 

1 

0.328 

RECOR, 

TAPE 

4 

0.072 

RECOR 

6 

15  total 

5.2  Discussion  and  conclusion 

Although  only  4  examples  were  analyzed  in  the 
previous  section,  a  general  pattern  may  be  recognized 
from  the  results  in  that  the  cross-correlated  version 
produces  the  most  relevant  items,  followed  by  the  least 
square  and  the  matched  methods,  while  the  2-parameter 
produces  the  least  number. 

In  general  the  matched  search  provides  an  overall 
unsophisticated  approach  to  document  searching  since, 
with  the  minimum  effort  on  the  part  of  the  questioner, 
useful  responses  may  be  obtained.  In  this  respect,  it 
may  be  regarded  as  a  tool  for  general  purpose  searches 
which  do  not  require  very  critical  results. 
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Although  many  drawbacks,  such  as  the  mathematical 
complexity,  large  computer  storage  and  longer  computing 
time  involved,  have  been  observed  in  the  more  refined 
matched  search  with  cross-correlated  terms,  responses 
obtained  were  very  favourable  and,  in  spite  of  some 
very  crude  estimates  of  numerical  values,  results  were 
generally  better  than  without  the  use  of  the  extra  terms 
in  regard  to  Recall  and  Precision.  It  must  be  mentioned, 
however,  that  the  choice  of  keywords  to  make  up  the 
question  was  partly  based  on  the  questioner's  familiarity 
with  the  literature  in  the  data  base.  Consequently  the 
success  of  the  search  was,  to  some  extent,  affected  by 
it.  The  same  success  may  or  may  not  be  obtained  by  another 
user  with  little  or  no  experience  in  question  formulations 
or  with  little  familiarity  with  the  base  keywords  in  the 
collection . 

As  can  be  seen  from  the  results,  the  2-parameter 
criterion  may  be  considered  a  rather  specialized  method. 
Results  were,  in  some  cases,  too  restricted  for  it  to  be 
regarded  useful  as  a  general  purpose  search  procedure. 

The  following  example  will  illustrate  this  fact.  One 

of  the  methods  currently  being  used  by  the  Alberta 
Information  Retrieval  Association  is  a  modified  version 
of  the  2-parameter  procedure  where  the  question  may  be 
specified  by  many,  instead  of  only  two,  parameters.  The 
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logics  used  may  be  1 AND 1  ,  'OR',  'NOT',  or  'NOR'.  It 

also  differs  from  the  2-parameter  approach  in  that 
the  weights  are  not  optimized  as  was  done  in  Chapter 
four . 

A  user  submits  his  question  for  searching  on  the 
fortnightly  Chemical  Titles  tapes  over  a  period  of 
time.  Results  on  each  search  are  returned  to  the 
user,  and  he  is  asked  to  indicate  the  titles  that  he 
regards  as  relevant.  A  data  base  based  on  a  number  of 
Chemical  Titles  issues  is  constructed ,  and  weights  are 
determined  by  word  counts  made  on  those  words  that 
appeared  in  the  relevant  titles.  Details  on  Chemical 
Titles  search  may  be  found  elsewhere  [12]  ,  the  follow¬ 
ing  question  may  serve  as  an  example: 

Profile  :  Peptide  Synthesis 

PI  :  PETIDE,  PROTEIN 

P2  :  SYNTHE,  DERIV ,  PREPARA 

From  the  accumulated  lists  of  27  relevant  titles,  the 
following  words  were  found  to  be  most  satisfactory  in 
obtaining  results  relevant  to  the  original  question. 


Normalized  weight 

Word 

1.000 

DES 

.523 

PEPTIDE 

.  500 

PROLYL 

.  455 


LYSYL 
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Normalized  weight 


Word 


385 


ORNITHINE 


261 


GLYCYL 


156 


THREON 


135 


SERINE 


134 


TERMIN 


132 


LYSINE 


064 


SYNTHE 


045 


PROTEIN 


While  PEPTDE  is  considered  an  important  word, 
the  original  words  DERIV  and  PREPARA  are  not  in  this 
list.  This  indicates  that  much  better  results  may  be 
obtained  when  the  above  list  of  words  are  used  as  a 
question  for  the  matched  search. 

As  shown  in  the  four  examples  the  least  square 
procedure  can  provide  very  satisfactory  results . 

Success  also  depends,  to  some  extent,  on  the  experience 
of  the  user  for,  by  manipulating  the  relevance 

estimates  f  (m)  ,  emphasis  may  be  placed  on  some  preferred 
keywords.  In  most  cases,  this  results  in  marginally 
improved  responses .  It  may  then  be  speculated  that  an 
experienced  user  would  prefer  to  use  this  method. 

As  has  been  observed  in  section  3.3,  a  degenerate 
form  of  the  least  square  method  exists.  In  this  case, 
a  user  may  specify  an  estimated  value  of  relevance  for 
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each  keyword  separately.  This  results  in  a  diagonal 
matrix,  and  the  computed  weights  will  be  equal  to  the 
originally  specified  values  of  relevance.  Since  the 
weights  are  not  optimized  the  response  will  not  be 
optimum.  In  spite  of  this,  a  user  may  sometimes  find 
it  useful  to  use  this  form  of  search  in  order  to  place 
emphasis  on  certain  keywords. 

A  factor  that  must  concern  the  user  is  cost. 

It  is  hoped  that  the  optimum  procedures  discussed  here 
will  lead  to  solution  of  some  of  the  problems  in  getting 
the  best  response  efficiently  and  economically.  Questions 
involving  computing  time  and  machine  storage,  as  well  as 
the  question  form,  are  responsible  for  the  total  cost. 

In  regard  to  this,  some  investigation  have  been  done 
in  relation  to  Chemical  Titles  searching  using  the 
matched  criterion  [ 12  J  -  At  present  no  such  cost  functions 
are  available  for  either  the  2-parameter  or  the  least 
square  procedures.  Until  the  time  that  these  are  esta¬ 
blished,  the  least  square  criterion  seems  to  offer  a 
more  satisfactory  procedure  for  document  search. 


I 


' 


71 


REFERENCES 


(1)  Borke,  H. ,  Burnaugh ,  H.P.  "Interactive  Displays 

for  Document  Retrieval" ,  Information  Display, 

Vol.  3,  pp.  47-90,  1966. 

(2)  Drew,  D.L.,  Summit,  R.K.,  Tanaka,  R.I.,  Whiteley,  R.B. 

"An  On-Line  Technical  Library  Reference  Retrie¬ 
val  System",  American  Documentation,  Vol.  17, 
pp .  3-7 ,  1966 . 

(3)  Salton,  G.  "A  Document  Retrieval  System  for  Man- 

Machine  Interaction" ,  Association  for  Computing 
Machinery ,  Proceedings  of  the  19  th  Annual 
Conference,  Philadelphia,  1964. 

(4)  Salton,  G.  "The  SMART  System  -  Retrieval  Results  and 

Future  Plans",  Scientific  Report  No.  ISR  -  11 
to  the  National  Science  Foundation,  Vol.  1, 
Section  1,  Cornell  University,  Dept,  of  Computing 
Sc .  ,  19  6  6. 

(5)  Salton,  G.  Automatic  Information  Organization  and 

Retrieval,  McGraw  Hill,  New  York,  1968. 


. 


' 


72 


(6)  Barkle,  J.K.  "Construction  of  Weighted  Term  Profiles 
by  Measuring  Frequency  and  Specificity  in 
Relevant  Items",  2nd  International  Conference 
on  Mechanized  Information  Storage  and  Retrieval , 
Cranfield,  Bedfordshire,  England,  September 
1969  . 


(7)  Wiener,  N.  Extrapolation,  Interpolation,  and  Smooth¬ 

ing  of  Stationary  Time  Series,  J.  Wiley  &  Sons, 
Inc.,  New  York,  1949. 

(8)  Levinson,  N.  "The  Wiener  RMS  Error  Criterion  in 

Filter  Design  and  Prediction",  Journal  of 
Mathematics  and  Physics,  Vol.  15,  No.  4, 
pp.  261-278,  January  1947. 

(9)  Dwork ,  B.M.  "Detection  of  A  Pulse  Superimposed  on 

Fluctuation  Noise",  Proceedings  of  IRE, 
pp.  771-774,  July  1950. 

(10)  Heaps,  H.S.  "Boolean,  Fractional,  and  Association 

Searches  on  Truncated  Title  Words",  Proceedings 
of  the  American  Society  for  Information  Science, 


Columbus,  Ohio,  October  1968. 


- 


. 


73 


(11)  Perlis,  S.  Theory  of  Matrices,  Addison-Welsely 

Press  Inc.,  Cambridge,  Mass.,  1952. 

(12)  Heaps,  H.S.  and  Thiel,  L.H.  "Optimum  Procedures 

for  Economic  Information  Retrieval" , 
Information  Storage  and  Retrieval,  Vol.  6, 


1970.  In  press. 


■ 


74 


APPENDIX  A 


Program  Listing 

The  following  pages  contain  source  programs  of 
the  matched,  the  cross-correlated,  the  least  square, 
and  the  2-parameter  search. 

In  the  matched  and  the  least  square  search  the 
subroutine  FINVER  is  used  to  compute  the  inverse  of 
a  matrix  using  an  existing  library  subprogram  CS011A. 

A  different  subroutine  FINVA  is  used  in  the  2-parameter 


search . 
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***  MATCHED  SEARCH  *** 


COMMON  WTT{ 200 ) , I WT ( 2  00 ) , C 

REAL  MAT (60, 60) , I N VER S ( 60 , 60 ) ,RHS < 60 ) , BB ( 60 , 60 ) , RR { 60) , XX f 60 ) 
INTEGER  PIVOT (60) ,C(100,100) 


DIMENSION  NOTERM ( 20)  , KEYWDt  20,20) ,NUMI (20)  , KOUNT t 200) , ITITLtlOO)  , 
II  TEN  (  100,50  ).,  STORE  ( 10  0)  ,  KEY  BAR  (200) 

DATA  KB  LAN/  '  *  /  ,  KZ  Z  Z7./ ’  Z  111  *  /  ,  KLBR/ *{  •/,KRBR/M  •/, 

1K.COM/  "  ,  ’/ 


C 

C 

C 

C 


READ  SPECIFIED  KEYWORDS  KEY  WD (I ,  M )  M=1  TO  KAPM 
FIND  N  U  MI  (  M  )  =  2  5>  NUMBER  OF  KEYWORDS  KEYWDt  I,  M) 


21  FORMAT ( 21 4) 

NUM=0 

DO  35  M=1,KAPM 
I  1=1 

24  12=11+25 

READt  5,25) ( KEYWDt I ,M)  , 1  =  1  1,12) 

25  FORMAT ( 13 ( A4, A2 ) ) 

13=12-1 

DO  23  I =1 1,13,2 

I E ( KEYWDt  I , M ) «EQ.  K8LAN)  GOTO  30 
28  CONTINUE 
11=11+26 
GOTO  24 
30  K= I -1 

WRITE  (6,333) 

333  FORMAT (/5X, *THE  FOLLOWING  ARE  KEYWORDS  FROM  A  QUESTION8 ) 
WR I TE  t  6 , 334 )  ( KEY  WD  U  ,  M ) , 1  =  1 1 , K ) 

334  FORMAT  (15X,2A4) 

NOTERM ( M)=K/2 

WRI TE ( 6 , 5  55  )  NOTE RM ( M ) 

555  FORMAT  (5X,  *N0.  OF  TERMS=’,I2) 

N'UHI  (  M)  =K 
NUM=NUM+NUMI (M) 

35  CONTINUE 

I F { NUM . LE . 200 )  GOTO  37 
WRITE (6,36) 

36  FORMAT ( 5X , 1  MORE  THAN  100  KEYWORDS  READ  *) 

STOP 

3^  IF ( (TRACE. NE. 1)  GOTO  40 
WRITE (6, 33)  KAPM 
33  FORMAT ( 5X , 1  KAPM= ’ , 14 ) 

WRITE (6,39)  MUM 
30  FORMAT ( 5X NUM= 14/ ) 


C 

c, 

c 

c 


40  11=1 


i 
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44  DO  50  M  =  1 , K A P M 
I2=I1+NUMI { M ) -1 
DO  48  1=11,12 
KEYBAR { I )=KEYWD( I-I 1+1, M) 

48  CONTINUE 
I  1=12+1 
50  CONTINUE 
NUM= 1 1 - 1 

C  SET  IWT(I)=1  FOR  1=1, NUM- 1  IN  STEPS  OF  2 

J=NUM- 1 
DO  55  1=1, J ,2 
I WT U  }  =  1 
55  CONTINUE 
C. 

C  RUBBLE  SORT  KEYBARCI) 

DO  78  K  = 1 , J , 2 
I=NUM-1 
L  A  ST=  0 

62  I F ( KEY BAR ( T )  -  KEYBAR I  1-2 ) )  66,64,76 
64  I F ( KEYBAR { I +1 ) -KEYBAR { I — 1 ) )  66,70,76 
66  IT=KEYB AR ( I ) 

K  EY  BAR ( I ) =K  E YB AR { 1-2 ) 

KEYBAR { 1-2 ) = I T 
I T= KEYBAR U  +1 } 

KEYBAR I  1+  1  ) =KEY  BARI T- 1 } 

KEYBAR ( I-1)=IT 
IT=IWT{  I  ) 

I WT ( I  }  =  IWT(  1-2) 

I  WT  ( I  -  2  )  =  I T 
L  4ST=  1 
GOTO  76 

70  IWT{  I  —  2  )  =  I W  T-  (  1-2)  +1  WT  (  I  ) 

i  wt  m=o 

KEYBAR  U)-KZZZZ 
KEYBAR (1+1) =KBL  AM 
L  AST= 1 

76  1=1-2 

I  F  {  T.GT.K  )  .GOTO  62 
IF(LAST.FQ.O)  GOTO  80 

78  CONTINUE 

ELIMINATE  DUPLICATE  KEYWORDS 
DO  79  1=1, J, 2 

IF  (KEYBARt IJ.EQ.KZZZZ)  GOTO  77 

79  CONTINUE 
GOTO  SO 

77  NUM=I - 1 

80  IF {  I  TRACE .NF. 1 )  GOTO  100 
WRITE (6, 39)  NUM 
NA=NUM-1 

WRITE (6,90)  ( ( I , KEYBAR ( I ) , KEYBAR Cl +1) , KLBR, IWT ( I ) » KRBR,  KCQM )  , 
A  1  =  1 , NA, 2 ) 

90  FORMAT  ( 5 ( 5X , I  3 , 1 X , 2 A4 , IX , A  1 ,  12 , 2 A1 ) / ) 

NUM I  C  M )  =  I  — 1 
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SET  C(I,J)=0  FOR  I,J=1  TO  NUM 
NN=  NUM / 2 

100  DO  102  I=1,NN 
DO  101  J=1,NN 
C{  I,  J  )  =  0 

101  CONTINUE 

102  CONTINUE 
MAG 4= 3 

SET  KOUNT (I)=0 

109  DO  103  1=1, NUM 
KOUNT ( T )=0 

108  CONTINUE 

READ  ONE  TITHE  FROM  TAPE 
NUM 1= NUM- 1 

110  READ { MAG4 I N , ( I T I T L ( I }  ,  I  =  1 , N ) 

TFIN.EQ.4)  GOTO  150 
COMPARE  ITITL  WITH  KEYBAR 
INCREMENT  KOUNT  (I) 

112  N l=M-l 

DO  120  1=5, Nl, 2 
DO  115  J=1,NU.M1,2 

IFIITITLI  I)  *NE. KEYBAR ( J I )  GOTO  115 
I  FUTITLU+1)  •  NE  .  KEYBAR  <  J  +  l )  }  GOTO  115 
KOUNT ( J  )=  1 
GOTO  120 
115  CONTINUE 
120  CONTINUE 

INCREMENT  C  ( I  ,  J  ) 

DO  130  1=1, NUM 1,2 
M=I /2+1 

DO  12  5  J=  l , NUM 1 , 2 
IF (KOUNT { J ) .NE.  1)  GOTO  125 
L  -  J  /  2  + 1 

I F ( KOUNT (  I )  .  EQ  .  1  )  C { M , L )  =  C ( M, L )  +  1 

125  CONTINUE 
130  CONTINUE 
GOTO  109 
C 

c 

150 

151 

e 

C  FIND  INVERSE  OF  MATRIX  MAT ( M , N ) 

CALI  F I NVER ( NN , MAT , I N VERS , RHS , XX , BB,RR, PIVOT ) 

DO  444  I=1,NA,2 

WRITE (6,2  001  1 , KEY  BAR (  I  ) ,KEY8 ARM  +  1 ) ,KLBR,WTT(  I ) ,KRBR 


PR  I  NT  C  (I  ,  J  ) 

1=  1 

WRITE (6, 151  )  ( (  I  t J  ,C(  I  ,  J)  , J=1,NNJ , 1  =  1, NN) 

FORMAT (  *  0 1  , 10X , 31 4 , 1 0  X  , 31 4 , 10X , 3 1  4 , 10X, 3 1 4 , 1  OX , 3 1  4 , 10X ) 


*'r. 


■ 
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200  F0RMATI/5X,  I2,1X,2A4,  IX, Al,  ,WT=>  ,F10.5,  Al) 

444  CONTINUE 

C 

C  FIND  RESPONSE  TO  EACH  HYPOTHETICAL  DOCUMENT 

DO  18  5  L= 1 »  KAPM 
DO  170  1=1, NUM 
K CUNT  m  =  0 
170  CONTINUE 

KK=NOTERM (L }*2-l 
DO  175  I  =  1 1  KK,  2 
DO  174  J= 1 , MUM  1 , 2 

I F ( KE YWD( I »  L ) ♦ NE • KEYS AR ( J } )  GOTO  174 
IE( KEYWDt 1+1 .,L) .NE. KEYBAR! J+1I)  GOTO  174 
KOUNT ( J }=1 

174  CONTINUE 

175  CONTINUE 
RES  P=0 . 

DO  178  1=1, NA, 2 
RESP=RF SP+WTT ( I )*KOUNT{ I } 

178  CONTINUE 

WRITE ( 6 ,1 30 }  RESP 

180  FORMAT l /5  X* *  RESPONSE  TO  A  HYPOTHETICAL  DOCU  I S * , F 16 .8, IX, M EX  ACT 
U  SOLUTION)*} 

185  CONTINUE 

FIND  RESPONSE  ^0  EACH  DOCUMENT  IN  TAPE 
REWIND  MAG4 
KK=  1 

300  RES  P-0 . 

201  DO  202  1=1, NUM 
KOUNT (I)=0 

202  CONTINUE 

READ  ONE  TITILF  FROM  TAPE 
READ(MAG4>N,  (  IT  IT  LI  I)  ,  I=1,N) 

IF  (M.EQ.4)  GOTO  250 

COMPARE  ITITL  WITH  KEYBAR,  INCREMENT  KOUNT(I) 

N 1=N— 1 

DO  210  1=5, Ml, 2 
DO  207  J=1,NA,2 

I F ( I T I TLI I )  . NE . KEYBAR { J ) )  GOTO  207 
IF{  ITITL(  1  +  1)  .NE.KEYB  ARU  +  1)  )  GOTO  207 
KOUNT ( J ) = 1 
207  CONTINUE 
210  CONTINUE 

DO  215  J=1,NA,2 
R F  SP  =  RE SP  +  WTT ( J ) *KOUN T ( J ) 

215  CONTINUE 

IEIRESP.LE.O. )  GOTO  3  00 

C  ARRANGE  OUTPUT  IN  DESCENDING  ORDER  OF  R { N ) 

STORE ( KK ) =R  ESP 
I TEM { KK , 1 )=N 
DO  292  1=1, N 
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ITEM(KK,I  +  1 )  =  I T ITL  (  I  ) 

292  CONTINUE 

IF  (KK.EQ.l )  GOTO  205 
LK-KK 

294  IE(STORF{LK).LE. STORE (LK-1)  )  GOTO  296 

IF (  ITEM (LK,  1) .GT.  ITEM (LK-1, 1)  }  GOTO  2  82 

281  NN= I TEM ( L  K-l , 1 ) 

GOTO  284 

282  NN=TTEM(LK| 1) 

284  DO  28  3  L  =  1 »  N  N 
ITEMP=ITFM( LK  j  L ) 

ITEM! LK,L)=  ITEM  (LK-1 »  L  ) 

ITEM! LK-1 t  L ) - 1 T  E  M  P 

283  CONTINUE 
TFMP= ST  ORE ( LK ) 

STORE { LK )= STORE ( LK-1 ) 

STORE { LK-  1J=TEMP 
29  5  L  K=  LK-  1 

IF(LK.NE.l)  GOTO  294 
296  IF{ KK.EQ.  100)  GOTO  300 

285  KK=KK+ 1 
GOTO  300 

250  WRITE (6 , 251 ) 

251  FORMAT! ' 1 1 //, 12X, ****  TITILES  RETRIEVED  *#**/) 
DO  288  KK=1 t 100 

WRITE (6,287)  STORE! KK ) 

287  FORMAT! /16X,'R(N)-5 ,F12.7) 

M= ITEM { KK , 1 ) 

W R I T E ( 6 , 2 39 )  I  I TEM I KK , LL ) ,LL=2»M) 

239  FORMAT { 12  X, 16A4 / ( 28X» 12A4 ) ) 

288  CONTINUE 

290  WRITE  16,291) 

291  FORMAT! /12X , ’ ***  SEARCH  COMPLETED  ****) 

REWIND  MAG4 

GOTO  10 
556  STOP 
END 


■■ 
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THIS  ROUTINE  COMPUTES  THE  INVERSE  OF  A  MATRIX  BY 
CALLING  A  LIBRARY  SUBROUTINE  CS011A 
C 

SUBROUT INE  F INVER ( N , M AT , I NVERS  , RHS T XX ,  PB, RR , P I VOT ) 

REAL*  4  MATIN, N)  , INVERSION)  ,RHS  (N ) ,XX I N ) , BB (N, N )  ,RR(N) 

INTEGER  PIVOT  IN) ,C(  10  0,  100) 

■COMMON  VJTTI200) , IWTI200),C 
DO  60  1.  1=1,  N 
DO  60  0  J  =  1 , N 
MAT (I , J)=C( I, J) 

600  CONTINUE 

601  CONTINUE 

CALL  CS 0 1 1 A l M AT , I N VER  S  , RH  S , XX , N , B B , RR , P I VOT ) 

WRITE (6,602 )(!I,J , INVFRSt I , J) , J=1 ?N) , 1=1, N) 

602  FORMAT ( /5X, ’THE  FOLLOWING  ARE  ELEMENTS  OF  THE  INVERSE  OF  MATRIX 
UCCIjJl’ /(4( 5X,2I4,F16 .8) ) ) 

C  FIND  EXECT  NTS  USING  EQUATION  10 
NA=2*N 

DO  190  1=1, NA, 2 
WTT( I  3  =  0. 

190  CONTINUE 

WRITE (6,1993 

199  FORMAT {/5X,  ’THE  FOLLOWING  ARE  WTS  BY  EXACT  SOLUTION:5) 

DO  195  1=1, NA, 2 
M=I /2+1 

DO  194  J- 1 , N A, 2 
L=J/2+l 

WTT ( I )  =  WT  T (  I }+ INVER SI M , L ) * I WT { J ) 

194  CONTINUE 

195  CONTINUE 
RETURN 
END 
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***  CROSS-CORRELATED  search  *** 

DIMENSION  ZB AR ( 100) , STORE! 100 ) , ITEM(100,53) 

DIMENSION  YBAR( 100) ,T( 100) 

D I MENS I  ON  NOTERM { 20 )  , KEYWDI 30,12)  ,NUMI  (  20)  , KOUNT { 2 00 ) , 1 TI  TL (1  00 ) , 
IK  EY  BAR ( 100) ,C(  100,  100 ), IWT{ 100) 

DATA  KBLAN/ '  * / , KZ ZZZ/ * Z ZZ Z ’ / ,  KLBR/ »  (  ' / , KR  BR / * )  «/, 

1KC0M/S  «/ 

READ  KEYWORDS  OF  QUESTION 
10  READ! 5, 21 , END=556 )  KA PM, I  TRACE 
21  FORMAT (214) 

NUM=0 

DO  35  M=1,KAPM 
I  1  =  1 

24  12=11+25 

READ! 5, 25 ) ( KEYWOI I,M) , 1=1 1, 12) 

25  FORMAT { 13 { A4, A2 ) ) 

13=12-1 

DO  28  1  =  1  1,13,2 

IFCKEYWDI I»M) .EQ. KBLAN)  GOTO  30 
28  CONTINUE 
I  1=11+26 
GOTO  24 
30  K= I - 1 

WRITE  16,333) 

333  FORMAT! /5X,  *  THE  FOLLOWING  ARE  KEYWORDS  FROM  A  QUESTION*) 

WRITE { 6 , 334 ) ( KEYWDl II ?M),II=I1,K) 

334  FORMAT  U5X,2A4) 

NOTERM ( M) =K/2  • 

WR ITE 1 6 , 5  55 )  NOTERM { M ) 

555  FORMAT  (5X,,NO.  OF  TERMS=*,I2) 

NUMI ( M) =1-1 
NUM= NUM+NUM I(M) 

35  CONTINUE 

IFINUM.LF .200)  GOTO  37 
WRITE (6,36) 

36  FORMAT! 5X , ’  MORE  THAN  100  KEYWORDS  READ  *) 

STOP 

37  IF  1  ITRACE.NE. 1 )  GOTO  40 
WRITE (6,38)  KAPM  - 

38  FORMAT ( 5X , 1  KAPM=*,I4) 

WRITE (6,39)  HUM 

39  F OR MAT ( 5X , * NU M= ’,14/) 

COMBINE  KEYWORDS  TO  FORM  KEYBAR(I)  IN 
ALPHABFT I C  ORDER  WITH  WEIGHTS  IWT(I) 

FIRST  MERGE  KEYWORDS 

40  11=1 

44  DO  50  M=1 , K ARM 
12=11 +NUM I ( M ) - 1 
DC  48  1=11,12 
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KEY BAR (  I ) =KEYWD ( I -1 1  +  1 ,M) 

43  CONTINUE 
T  1=12+1 
50  CONTINUE 
NUM=I  1-  1 

C  SET  IWT(I)=1  FOR  1=1, NUN- 1  IN  STEPS  OF  2 

J=NUM-1 
00  55  1=1 , J , 2 
IWT(  I )=1 
55  CONTINUE 

BUBBLE  SORT  KEY  BAR { I ) 

DO  78  K=1,J,2 
I =NUH— 1 
L  A  S  T  =  0 

62  IF { KEYBAR (I)  -  KEY BAR { 1-2 ) I  66,64,76 
64  I F { K  E Y  B  AR (  1  +  1 } - K  E  YB  AR ( I  —  1 )  )  66,70,76 
66  I T=KE YB AR { I ) 

KEYBAR! I ) =KEYB AR ( 1-2) 

KEYBAR { I  — 2  >  = IT 
I T= KEYBAR { I +1 ) 

KEYBAR ( I +  I } =KEY8AR { 1-1) 

KEYBAR ( 1-1 )= IT 
I T= I WT (  I  ) 

IWT( I  )=  I  WT (  1-2) 

IWT{ 1-2 )=IT 
L  A  ST=  1 
GOTO  76 

70  IWT ( I-? )=IWT{  I  - 2 )  +  I WT {  I) 

I WT ( I  1  =  0 
KEYBAR {  I  )  =  K  Z  Z  Z  2 
KEYBAR  (  I+D-K3LAN 
L  AST=  1 

76  1=1-2 

I F{ I. GT.K )  GOTO  62 
I F { LA  ST .E  Q. 0)  GOTO  80 
7 8  CONTINUE 
C 

C  ELIMINATE  DUPLICATE  KEYWORDS 

DO  79  1  =  1 , J , 2 

IF  (KEYBAR! I) .EQ.KZZZZ)  GOTO  77 
7°  CONTINUE 
GOTO  30 

77  NUM= I —I 

30  IF(  ITRACF.NE.l )  GOTO  100 
WRITE (6,39)  NUN 
NA=NUM— 1 

WRITE  (6,90)  (  I  ,  KEYBAR  {  I  )  ,  KE  YB  AR  ( I  + 1 }  ,  KL  BR  ,  I  WT  (  I)  ,  KRBR  ,  KCO'M  , 

A  I =1 , NA , 2 ) 

90  FORMAT  ( 5 ( 5 X , I  3 , 1 X , 2A4 , IX , A 1 , 1 2 » 2 Al ) / ) 

C 

C  SET  COUNTER  CU,J)  =  0 

NK=NUM/2 
NUM  1=NIJM~  1 
NN=NK*«2 


. 
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100  DO  102  1=1, NN 
YB AR ( I )=0. 

DC  101  J=1,NN 
C  (  I ,  J  )  =  0. 

101  CONTINUE 

102  CONTINUE 

C 

C  FIND  YBAR  IN  TERMS  OF  KEYBAR 

DO  760  L  M  =  1 r K A P M 
DO  705  1=1 , MUM, 2 
KOUNTU  )  =  0 
705  CONTINUE 

K  K  -  NO  TE  RM { L  M ) *  2-1 
DO  715  J= 1, KK,2 
DO  714  M=1,NUM1,2 

IF(KEYWD( J,LM) .ME. KEY  BAR (Ml  )  GOTO  714 
IF(KEYWD(  J  +  1,LM).NE.KF:YBAR(M+1)  }  GOTO  714 
KCUNT ( M )  —  1 

714  CONTINUE 

715  CONTINUE 

DO  160  L=1,NK 
LL=L*2- 1 

I F! KOUNT ( LL ) . EQ.O )  GOTO  160 
DO  162  M= 1 , NK 
MM=M*2- 1 

IFC KOUNT (MM) .EQ.O )  GOTO  162 
K=l L-l ) *NK+M 
IF(L.EQ.M)  GOTO  165 

YB  AR { K ) =Y BAR { K } +K  OUN T ( LL } *K OU  NT ( M M } / 2  * 

GOTO  742 

165  YR  AR.  (  K )  =  YBAR  !  K  )  +KOUNT  ILL) 

742  W RI TE ( 6 , 7 50 } K , L ,  M , YB AR ! K ) 

7 50  FORMAT! 5X , » K= 1  , 1 3  ,2X , 1  1  =  '  » I  3 , 2X , *  J= ' , 1 3 , 2X , F 1 0 . 5 } 

162  CONTINUE 
160  CONTINUE 
760  CONTINUE 

WRITE (6 ,765  }  t  K , Y  BAR ! K ) ,K=l,NN) 

765  FORMAT! ’O’ ,5{8X,I4,2X,F7.3) ) 

S  FT  COUNTER  KOUNT !T }  =  0 
MAG 4= 3 

100  DO  103  1=1, NUN 
KCUNT { I )=0 

103  CONTINUE 
DO  168  1=1, NN 
Z  B  A  R (  I )=0 . 

163  CONTINUE 

READ  ONE  TITHE  FROM  TAPE 
1 10  READ! M A G4 } N , ( IT ITL ( I )  , 1  =  1 , N } 

IF(N.EQ.A)  GOTO  150 

COMPARE  ITITL  WITH  KEYBAR 
INCREMENT  KOUNT (I) 

I  FLAG-0 
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112  N  1  =  N— 1 

DO  120  1=5, Nl, 2 
DO  115  J= 1 , MUM  1 r  2 

IFIITITU  I)  .NE.KEYBAR ( J)  )  GOTO  115 
TFUTITK  1+1)  .  OF  .  KEYB  AR  (  J  +  l )  )  GOTO  115 
KOUNT ( J )=1 
I FL AG= I FL  AG+1 
GOTO  120 
115  CONTINUE 
120  CONTINUE 

IF  UFLAG.LE  .0)  GOTO  110 
C 

C,  FIND  Z  (  K  )  IN  TERMS  OF  Z  {  I  ,  J  ) 

DO  172  1=1, NK 
11=1*2-1 

I E ( KOUNT (  II  I  oEQ.O  )  GOTO  172 
DO  175  J=1,NK 
J J= J*2- 1 

I F ( KOUNT { J J ) . EQ . 0  I  GOTO  175 
K= { 1-1 )*NK+J 
I F ( I . EQ « J }  GOTO  178 
Z  P  AR ( K) =KQUNT{ I  I ) *KOUNT { J  J ) / 2 . 

GOTO  175 

178  ZBARIK1=K0UNTU  I  ) 

175  CONTINUE 
172  CONTINUE 

DO  180  M= 1 »  NN 

IF (ABS! ZBAR(M) ) .LE. 0.0001 )  GOTO  180 
DO  185  N=1,NN 

I F ( ABS ( ZB AR ( N ) ) .LE.0.00015  GOTO  185 
C ( M , N ) =C ( M, N) +ZBA  R { M ) *ZBAR i U ) 

185  CONTINUE 
130  CONTINUE 
GOTO  109 
150  SQM=0. 

DO  230  1=1, NN 

IF! ABSI C(  I,  I  )) .LE. 0.0001)  GOTO  230 

aa=ybar  m/cn ,  I) 

A  B  =  0 . 

DO  220  J=1,NN 

IF! J.EQ.I )  GOTO  220 

IF! ABS! C( J, J) ) .LE  .0.0001 )  GOTO  220 

AB=AB+C {  I , J ) *YB  AR  ( J ) / C ( I , I ) *C ! J , J ) 

220  CONTINUE 

T! I  1  =  AA-AB 

WRITE! 6, 225  )  I,T(  I) 

225  FORMAT ( /5  X,  1 WT- EL  EM  ENT ( 1 ,12,'  >  =  ’ ,F16.5) 

C 

C.  FIND  MEAN  SQUARE  OF  VI HOLE  COLLECTION 

SQM  =  SQM+T { I  )*YBAR  !  I ) 

230  CONTINUE 

WRITE (6, 190)  SQM 

)  oq  FORMAT  ( /5X,  1  MEAN  SQUARE  VALiJE  OF  COLLECTION  :*,F16.5> 
REWIND  MAG4 
KK=  1 


336  on  33  3  1=1, NUM ,  2 
KGUNT ( I ) =0 

338  CONTI  MUG 

DO  339  J= 1 , NN 
Z  BAR ( J ) =0 . 

339  CONTINUE 

341  READ! MAG4 ) N , U T  ITL M ) ,  1  =  1 , N ) 

I F ( N*  EQ .4 )  GOTO  400 

837  I FL AG=0 

DO  340  1  =  5 , N ,  2 
DO  342  J=1,NUM1,2 

T F( IT  ITL {  I )  .NF.KEYBAR { J))  GOTO  342 
I F < IT ITL ( 1  +  1) . NE.KEYBARCJ+1) )  GOTO  342 
KGUNT ( J ) = 1 
I FL AG= I  EL AG  +  1 
GOTO  340 

342  CONTINUE 

340  CONTINUE 

I F ( I F LAG . LE , 0 )  GOTO  34  1 

FIND  ZBAR(R)  IN  TERMS  0^  Z(I,J) 

DO  350  1=1, NK  . 

I  I=I*2-1 

I F ( KGUNT (  II  I . EQ .0  )  GOT 0  350 
DO  355  J=1,NK 
J J=J*2- 1 

IF { KOUNT ( JJ ) o  EQ.O )  GOTO  3  55 
K=(  I-1)*NK+J 
I F (  I  - EQ . J I  GOTO  353 
Z  BAR { K ) =KOU  NT { I  I) *KGU  NT { J  J ) / 2  * 

GOTO  355 

3  53  Z  BAR  {  K )  =K01JNT  (II) 

355  CONTINUE 
350  CONTINUE 
R  E  S  P  =  0  . 

DO  378  1=1, NN 

IF ( ARS { ZBAR ( I )) .LE. 0.0001)  GOTO  37  3 
RFSP=RESP+T ( I ) * ZBAR ( I ) 

378  CONTINUE 

I F ( ABS ( RFSP ) . LF .0 .001) GOTO  336 
T  S=RE  SP/SQM 

ARRANGE  OUTPUT  IN  DESCENDING  ORDER  OF  R I N ) 

STORE (KK)=TS 

I T  F  M { K  K , 1 )=N 

DO  380  1=1, N 

ITEM!  KK  ,1+1  )  =  IT  ITU  I  ) 

380  CONTINUE 

IF(KK.EQ.l)  GOTO  385 
L  K  =  KK 

394  IF ( STORE! LK ) .LE . STORE ( LK-1 ) )  GOTO  396 

I F ( I T  E  M ( L  K  t 1 )  . GT  .  I T E  M { LK- 1 , 1 ) )  GOTO  38  2 

M  M=  I  T  E  M  (  L  K—  1  ,1  ) 

GOTO  384 

382  3M= ITEM (LK, 1 ) 


■ 


. 
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384  DC  383  L= 1 , MM 

I  TEMP- ITEM  (LK,L) 

ITEM ( LK , L )= I TEM ( L  K- 1 , L ) 

ITEM( LK-1 ,L )= I  TEMP 
333  CONTINUE 

TEMP- STORE! LK) 

STORE (LK)-STQRE(LK-l) 

STORE ( l K- 1 ) -TEMP 
LK-LK-1 

IE! LK.NE. I )  GOTO  394 
396  IFUK.EQ.  100)  GOTO  386 

385  KK-KK+1 
GOTO  336 

336  KK- 100 
GOTO  336 

400  00  388  KK- I ,100 
M-I TEM! KK  ,  1  ) 

WRITE (6,387)  STORE (KK) 

387  FORMAT! /5X, ’RESPONSE  RATIO  RESP/S QM- * , F 15 . 7 ) 
WRITE (6,389)  ( ITEM! KK, LI ) , LL  =  2»M) 

38°  FORMAT! IX, 33A4/) 

308  CONTINUE 

390  WRITE (6,391) 

391  FORMAT! 5X, ' ***  SEARCH  COMPLETED  ***•) 

REWIND  MAG 4 

GOTO  10 
556  STOP 
END 
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C 
C 

c 

C  ***  LEAST  SQUARE  SEARCH  *** 

C 

RPAL  MAT( 20,20) ,1 NVERS (20,20)  ,RHS ( 20 ) , BB ( 20 , 20 ) , RR ( 20 )  ,  XX  (  20 ) 
INTEGER  PIVOT (20) 

COMMON  V(100) ,WTT (200 ) ,U(50,50) 

0  I MFMSI ON  F { 1 5  )  , NUM I (  15 ) , KOUNT ( 15,50)  , NOTERM ( 20 )  , 

IK  CUNT  1  (  10  0)  ,  I  TITL  (200  )  ,STORE(  100)  ,ITEM(  100,50)  , KEY  V)D(  20,40)  , 
2K  EYBAR ( 20  0) 

DATA  KBLAN/ ’  1 / , KZ 11 1/ • ZZZZ 5 /, KCOM/ 1 ,  */ 

10  READ (5, 2, END=555)  KAPM 

2  FORMAT ( 14 ) 

NUM=0 

DO  25  M=1,KAPM 
C 

C  READ  IN  THE  TITILES  OF  QUESTION 

C 

READ ( 5,3)  F ( M ) 

3  FORMAT ( F4 . 0 ) 

I  1=1 

4  12=11+25 

READ ( 5,8) { K  EYWD ( I ,M) , 1=1 1,12) 

8  FORMAT ( 13 ( A4 , A2 ) ) 

I  3=1  2-1 

DO  12  1=11,13,2 

I F(KEYWD( I ,H) .EQ. KBLAN)  GOTO  20 
12  CONTINUE 
I  1=11+26 
GOTO  4 

20  K= I— 1 
WRITE (6,21  ) 

21  FORMAT ( 15X, *  THE  FOLLOWING  ARE  KEYWORDS  FROM  A  DOCU 1  ) 

WR ITE ( 6 , 2 2 ) ( KE YWD (I , M  5  ,  I =  1 1 ,  K  ) 

22  FORMAT ( 15 X, 2A4 ) 

WRITE (6,23)  F ( M ) 

23  FORMAT  (  15X ,.  '  F{M)  =  !,F5.2/) 

NUM I (M)=K 

NUM=  NUM+NUM I (M) 

NOTERM  ( ,M)  =K/2 
25  CONTINUE 

WRITE (6, 28)  NUM 
28  FORMAT ( 5X , 1 NUM= % 14/ ) 

C 

C  COMBINE  KEYWORDS  TO  FORM  KEYBAR  IN  ALPHABETIC 

C  ORDER, AND  MERGE  THE  KEYWORDS 

C 

I  1=1 

DC  35  M=1 , KAPM 

12=11  +  NUM I { M3  —  1 

DO  30  1=11,12 

K  E Y  BAR (I) =K  EYWD ( I ~ I  1  +  1  , M ) 

30  CONTINUE 
I  1=1  2+1 


■ 


. 


n  n 


35  CONTINUE 
NUM=  I  1.-1 
J=NUM-1 
C 

C  RUBBLE  SORT  KEYBAR! I) 

00  50  K=1,J,2 
I =NUM- 1 
LASTED 

AO  I  E  (  KEY  BAR  (I  )  -KE  YB  AR.  (I  -  2  )  )  42,44,46 
AA  IE( KEYBARI 1+1 l-KEYRAR ( 1-1) )  42,48,46 
42  IT=KEYBAR(I) 

KEY BAR { I ) =  KEYB  AR (  1-2) 

KEY  BAR ( 1-2 ) = IT 
I T=KEYB AR ( 1+1 ) 

KEYBAR ( 1+ 1)=KEY3AR{  I  -  1  ) 

KEY  BAR ( I- 1 ) = IT 
L AST= 1 
GOTO  46 

48  KEYBAR { I )=KZZZZ 
KEYBAR ( 1+1) =KBLAN 
LAST-  1 
46  1=1-2 

I  F( I . GT.K )  GOTO  4  0 
I  FI  LA  ST  *  EQ . 0 )  GOTO  52 

50  CONTINUE 

DO  51  I=1,J,2 

I F I KEYBAR ( I  ).EQ*KZZZZ)  GOTO  53 

51  CONTINUE 
GOTO  52 

53  NU M= I  —  1 

52  'WRITE  (6,2  3)  MUM 
NA=NUM- 1 

WRITE (6,54) ( ( I , KEYBAR ( I ), KEYBAR (I+l),KC0M),I=ltNA,2) 

54  FORMAT { 6( 5X,I2, 1X,2A4, IX, A1 )/ ) 

S  FT  U<I,J)  =  0,  U  IS  THE  CROSS  CORRELATION, 

C.  SET  VII )=Q,  V  IS  MEAN  ESTIMATED  WT  OF  ITH  WORD 

DO  60  1=1, NUM , 2 
DO  58  J=1 , NUM , 2 
U  (  I  ,  J  )  =  0 . 

58  CONTINUE 
V  (  I  )  =  0. 

60  CONTINUE 

00  63  N=1 , KAPM 
no  62  M=1 , NUM, 2 
KOUNT { N , M ) =0 

62  CONTINUE 

63  CONTINUE 

no  70  L  =  1 , K A P M 
KK=NOTFRM  (L  )  *2.-1 
no  67  1=1  , KK,  2 
DO  68  J  =  1 , N  A , 2 

IF( KEYWDII , L ) .NE. KEYB AR{ J  )  )  GOTO  68 
IF ( KEYWD( 1+ 1 ,L ) .ME* KEYBAR (J+l ) )  GOTO  68 
KOUNT ( L , J ) = 1 
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. 


on  non 


6 a  CONTINUE 
67  CONTINUE 
70  CONTINUE 

DO  75  I  =  1 » N  A ,  2 
DO  73  L  =  1 ,  K  A  P  M 

V (  I  )  =  V{  I  )+F(L)*K0UNT(Lf  I  ) 

7 3  CONTINUE 

75  CONTINUE 
DO  80  I =1 1 N A  t  2 
DO  78  J=lfNA,2 
DO  76  L  =1  ,  KAP M 

U ( I »  J ) =  U (  I , J) +  KOUNTU 1 1 )*K0UNT( L, J> 

76  CONTINUE 
78  CONTINUE 
80  CONTINUE 

WRITE (6,90)  {  V  C I ) ,  1  =  1, NA, 2) 

90  FORMAT (  4X  , ! THE  FOLLOWING  ARE  VII)’  /  {4X,5F16.8)) 
WRITE (6,91)  {  (  I  ,  J,U!I  ,  J)  ,  J=1  ,NA,  2)  ,  1=1  ,NA,2) 

93.  FORMAT! /5X,  ‘THE  FOLLOWING  ARE  U  ( I  ,  J  )  *  /  {  4!  5X  ,  2  14,  F  16  .8  ) 

FIND  INVERSE  OF  MATRIX  MAT ( M,  N  3 

N=NUM/2 

CALL  FI NV  ER (  N  A ,  N , HA T , I N VE RS , RH S , XX , B8 , R R , P I VOT ) 

DO  444  1=1, NA, 2 

WRITE (6,200)1  , KEY  BAR (  I } , KEY  BAR { 1  +  1) , K  LBR , WTT ( I )  ,KRBR 

200  FORMAT { /5X,  12, IX, 2A4,  IX, A  1,  SWT=  * , F10. 5, A1 ) 

444  CONTINUE 

FIND  RESPONSE  TO  EACH  DOCUMENT 
C 

MAG4= 3 
KK=  1 

300  RESP=0. 

201  DO  202  1=1, NUM 
KOUNTI ( I )=0 

202  CONTINUE 
C 

C.  READ  ONE  TITILF  FROM  TAPE 

READ ( MAG4 )  N,  {  IT  ITL (  I  )  ,  1  =  1 , N ) 

IF  (N.EQ.4)  GOTO  250 
C 

c.  COMPARE  ITITL  WITH  KEYBAR  ,  INCREMENT  KOUNT (  I  ) 

N  1=  N-  1 

DO  210  1=5, N I, 2 
N0M1=NUM- I 
DO  207  J=1,NUM1,2 

I  F(  IT  ITU  I  )  .NE.  KEYBAR  (  J  )  )  GOTO  20  7 
I E ( I T I T L (  I  +  1 )  . ME. KEYBAR (J  +  l) )  GOTO  207 
KOUNT If J ) =1 
G  OT  0  2 1  0 
207  CONTINUE 
210  CONTINUE 

DP  215  J=  1 , NUM 1 , 2 

R.  E  SP=  RE  SP  +  WTT  {  J  )*K0UNT1(  J) 
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CONTINUE 

IHRFSP.LT. 0.  )  GOTO  300 
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c 

C  ARRANGE  OUTPUT  IN  DESCENDING  ORDER  OF  FUN) 

STORE (KK)=R ESP 
I TEM { KK , 1 )  =  N 
DO  392  1=1,  N 
ITEM(KK,I+1 )  =  T  T  I T  L  ( I) 

292  CONTINUE 

IF  (KK.EQ.l)  GOTO  285 
LK=KK 

294  IF ( STORE ( LK  )  . LE  .  STORE  ILK- 1 ) )  GOTO  296 

T  F ( ITEMtLK,  1 ) -I TE Ml LK- l , I ) )  281  ,282,282 

281  NN=ITEM  (l.K-1,  1  ) 

GOTO  284 

282  NN= ITEMtLK, 1) 

284  DC  283  L= 1 , NN 

I TEMP= I TE  M( LK, L ) 

ITEM{ LK,L >= ITEM (LK-1, L) 

ITEMt LK-1 ,L )= ITEMP 

283  CONTINUE 
TEMP=STOREt LK } 

STORE  ILK) -STORE ( LK-1 ) 

STORE ( LK- 1) -TEMP 

295  IK- LK-1 

IFt LK.NE. 1)  GOTO  294 

296  ITt KK.FQ. 100)  GOTO  286 

285  KK-KK+1 
GOTO  300 

286  KK- 100 
GOTO  300 

250  DC  238  KK=1,100 
M=I TEH ( KK , 1 ) 

WRITE (6,287)  STORE ( KK ) 

287  FORMAT! / 5 X ,  *  RE  SP-f  , F 1 5 .7 ) 

WRITE! 6,289  )  {  I TEM { KK , LL ) ,LL=2,M) 

289  FORMAT! IX ,3 3A4/ 5 

288  CONTINUE 

290  WRITE  (6,291) 

291  FORMAT! 5X ,' ***  SEARCH  COMPLETED***1) 
REWIND  MAG 4 

GOTO  10 
555  STOP 
END 


' 


°  <“>  oo  ooooo 


91 


2-PARAMETER  SEARCH  *** 


D I  MEN  SI  ON  KEYWD (40 , 2 )  , KT 1 ( 40}  , KT2 (40)  , ITITL ( 100) 
REAL.  MAT  {  20 , 2  0  )  ,  I  NVER  S  (  20 , 2  0  )  ,RHS  (20)  ,BS(  20 ,20)  , 
INTEGER  PIVOT ( 20)  ,0(20,20) ,  ITEM!  100, 56)  ,  KBL AN / ' 
REAL  VI  (  20)  , V2 ( 20  )  ,  STORE-  (  1Q0  ) 


, RR  (  20 )  , XX { 20 ) 

'/ 


READ  KEYWORDS  IN  THE  2  VECTORS 


I  DC  20  M =1,2 
I  1-1 


7  FORM  AT (  1 3 { A 4 , A 2  )  ) 

DO  8  1-11,12,2 

IF(KEYWD(  I  ,M)  .EQ.  K8LAN)  GOTO  10 

8  CONTINUE 
I  1=11+26 
GOTO  2 

K,MJM  INDICATING  THE  LAST  PART  OF  LAST  WORD 
10  K  =  I  —  1 

WRITE (6,5) 

5  FORMAT! /5X,  1  KEYWORDS  OF  AN  INPUT  VECTOR’/) 
WRI TE ( 6 , 1 2 )  (  K EYWD-l  I  ,  M  )  ,  I  =  I  1  ,  K ) 

12  FORMAT (/5X, 2A4) 

NUM=K 

NTERM-K/2 

WRITE (6 ,15)  NTERM 

15  FORMAT ( /5X,  1  NO.  OF  TERMS  -’,14/) 

20  CONTINUE 

DC  444  1-1,20 
DC  44  3  J  =  1 , 2 0 
C  (  I  ,  J  )  =  0 

443  CONTINUE 

444  CONTINUE 


Li  U  — 

GCTO  45 

99  WRITE (6, 199) 

199  FORMAT! 5X  , »***  ERROR  ***') 


INITIALISE  KT 1 , KT 2 
43  DC  48  I  =  1 ,  NU.M  ,  2 


KT 1 ! I )=0 
KT2 ( I )=0 


48  CONTINUE 


C 

c 


READ  ONE  T I  TILE  FROM  TAPE 
NA-NUM-  1 

READ!  MAG,  ERP-99  )  N,  (ITITL  (  I  ),  I=1,N) 
I  F  (  N .  E  Q  •  4  )  GCTO  5  00 


C 

C 


C CM PA  RE  ITITL  WITH  KEYWD! I , M ) 
N  1=N- 1 

DC  70  K= 5 , N  1 , 2 


. 
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DC  68  I =1 , N A, 2 

I F { IT  I T L ( K)  .NE .KEYWD{  I , 1 )  )  GOTO  93 
IF  (IT  ITU  K+l)  .  NE.  KEYWDI  1  +  1,  1)  )  GOTO  5  3 
KT1 (1)=1 

53  IFIITITL(K) .NE.KEYWCI 1,2) )  GOTO  68 

I  FI  IT  ITU  K+l)  .NE.KEYWCl  1  +  1,2)  )  GOTO  6  3 
K  T  2  (  I  )  =  1 
63  CCNTTNUE 
70  CONTINUE 
C 

C  INCREMENT  C(I,J) 

DO  80  1=1, NUM, 2 
L=I/2+l 

IFIKTl(I).iNE.ll  G  CTO  80 
DO  7  5  J  =  1 , NUM »  2 
M=J/2+l 

IF ( KT  2 ( J )  .NE. 1 >  GOTO  75 
C  {  L  ,  M  )  =  C  (  L  ,  P  )  +  1 
75  CCNTINUE 
80  CONTINUE 
GCT 0  45 

500  WR I TE ( 6 , 3  5 )  { { I , J , C ( I , J  )  ,  J=1 , NTERM ) , I  =  I , NTERM ) 

85  FORMAT (  /(  10 X, 314)  ) 

FIND  INVERSE  OF  C,  MAT  IS  C  IN  REAL  MODE 
CALL  F I NV  A ( NTERM  j MAT,  INVERS  ,RHS , X X , BR , RR , P I VOT , C , V 1 , V2 ) 
S  L  M  V 1  =  0  . 

DC  106  1=1, NTERM 
SUMV1=SUMVI+VU  I  ) 

106  CCNTINUE 
WRITE (6t1G9)  SUMV1 

109  FORMAT  (  /5X,  '  SUKV1=  *  ,  F  16.6  ) 

C 

C  FIND  RESPONSE  TO  A  GENENAL  DOCUMENT 

C  READ  A  TITLE  AND  CETERMIME  ITS  KEYWORDS 

C  WITH  RESPECT  TO  PI  AND  P2 

REWIND  MAG 
NC  =  0 
KK=1 

100  RESP=0. 

A  1=0. 

A  2=0. 

DO  107  1=1, NUM, 2 
KT1  (I  )  =  0 
K  T  2 (I  )  =  0 

107  CCNTINUE 

R  E  A  0  (  MAG)  N,  (  IT  ITU  I  )  ,  1  =  1, N) 

I  F ( N . E Q • 4 )  GOTO  6  00 
N  1  =  N-1 

DC  115  K  =  5 , N1 , 2 
DC  110  I =  1 , N A , 2 

I F ( IT ITL ( K) .NE .KEYWDI I , 1 ) )  GOTO  103 
I  F  (  IT  ITU  K+  1 )  .  NE.  KEYWD  (1  +  1,1))  GOT 0  '  1  03 
K  T  1  {  I  )  =  1 

108  IF ( IT  ITU K)  .NE. KEYWDI  I ,2)  )  GOTO  110 


- 


. 

' 
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I  r  (  I T  I T  L  (  K+  1 )  .  N E  .  K E  Y W  D  {  I  + 1 ,  2  )  )  GDT  □  110 
K  T  2  ( I  )  -  1 
110  CONTINUE 
113  CONTINUE 

DC  120  1=1, N A, 2 
M=I/2+l 

A 1-A1+V1 ( M) *KT1 ( I  ) 

A  2= A2  +  Y  2 ( M ) *  K  T  2 ( I  ) 

120  CONTINUE 

I F { ABS{ A1 }. LE • C .00001 .GR. ABS ( A2 ). LE. 0 .00001 )  GUT  0  100 
W  R  I T  E  (  6 ,  1  2  1  )  A  1 ,  A  2 

121  FORMAT !/5X,  'Al='  , F16.6,  10 X,  ' A2=* , FI 0.6) 

NG-NO+l 

R  E  S  ?  =  A 1  v  A  2  /  SU  M V  1 


ARRANGE  OUTPUT  IN  DESCENDING  ORDER  OF  R { N ) 
STORE! KK}- RES? 

ITEMl KK, 1 )=N 
DC  130  1  =  1, N 
ITEM  KK  ,  I  +1  )=  I  T  I TL  (  i) 

130  CONTINUE 

IF(KK.EQ.l)  GOTO  152 

lk-kk 

132  I E { A B S ( S  T  0  R  E  ( L  K  }  )  .L E. ABS { STORE { LK-  1 ) )  )GOTO 
I F { ITEMILK, 1) .GT.  ITEM(LK-1, 1)  )  GOTO  140 
137  NN= ITEM  ILK- 1,1) 

GCTO  142 

140  NN=ITEM(LK,  1) 

142  DC  145  L=1,NN 

ITEMP  =  ITEM[ IK , L  ) 

ITEM! LK ,L J= ITEM! LK- 1, L ) 

ITEM! LK-1  , L  )  =  I T  EM  P 
145  CONTINUE 


TEMP- STORE! LK) 

STORE (L K) -STORE { LK- 1  } 

STORE ( LK- 1 ) -TEMP 
LK-LK-i 

IF ! LK .ME. I )  GOTO  132 
150  IF(KK.EQ.'lOO)  GOTO  ICO 
152  KK-KK+1 
GCTO  100 

600  I F { NO .GT. 100}  NG-100 
DC  169  KK=1,N0 
WRITE (6, 162)  ST  ORE ! KK ) 

162  FORMAT! /5X,  'RES°=  '  , FI  5.7) 

M- ITEM ( KK , 1  ) 

WRITE! 6, 165) (ITEMtKK, L)  ,L=2,M) 
165  FORMAT ( IX, 33A4//) 

169  CONTINUE 

WRITE (6, 175) 

175  FORMAT! /5X,  »***  SEARCH  COMPLETED 
REWIND  MAG 
GOTO  1 


150 


555  STOP 
END 


i 
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C 

c 

THIS  ROUTINE  INVERSE  A  MATRIX  AND  COMPUTE 
THE  WEIGHTS  BY  CALLING  THE  LIBRARY  SUBROUTINE  CS011A 

SUBROUTINE  FI NVA ( N,  MAT , I NVERS, RHS , XX , BB , RR , P I  VOT ,  C  »  VI ,  V2 ) 

RFAL  VI (20) t V2 { 20 ) 

R  FAL*4  MAT ( N , N ) , I NVERS ( N, N ) , RHS ( N ) , XX { N ) , BB ( N  t  N )  ,R R ( N ) 

INTEGER  PIV0T(N),CI20,20) 

DO  601  1  =  1, N 
DO  600  J=1,N 
MAT (I , J)  =  C(  I, J) 

600  CONTINUE 

601  CONTINUE 

CALL  CS01 1  A ( MAT , I NVERS  »RHS, XX  *  Nt  BB, RR ,P I VOT ) 

WRITE (6, 6 02 ) ( ( I, J ? I NVERS ( ItJ)tJ=l»N)»I=l,N) 

602  FORMAT! /5 X, ’THE  FOLLOWING  ARE  ELEMENTS  OF  THE  INVERSE  OF  MATRIX 
UC(I,J)’ /(4( 5X,2I4,F16.8) ) ) 

no  101  i=i,  n 

VI ( 15=0. 

V2( I )=0. 

101  CONTINUE 

DO  10  5  1=1,  N 
DO  10  3  J=1,N 
V1(I)=V1(I)  -f- INVER  S  (  J  ?  I  ) 

V  2 ( I)=V2( I ) +1 NVERS { I , J ) 

103  CONTINUE 

WRITE (6, 104  )  VI (I)  , V2II  3 

104  FORMAT { /5X, 1 VI ( 1)='  ,F  16. 6,1 OX, ' V2 { I )=» , FI 6. 6) 

105  CONTINUE 
RETURN 
END 
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APPENDIX  B 


In  response  to  the  question  concerning  machine, 
engine ,  and  fan,  noise  the  matched  search  produced 
a  list  of  100  titles  arranged  in  descending  order  of 
relevance  as  indicated  by  R(N).  The  following  pages 
contain  a  portion  of  that  list. 
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❖  ❖  # 

TITILES 

RETRIEVED 

❖  * 

ACC 

FUN  )  = 

55  027 

0*6666667 
2 17 RERAN 

K  AMP  E 

ALLEN 

NOISE 

CENTR 

FANS 

ACC 

R  (  N  )  = 

56  02  8 

0  *  6666  667 
681NI EKE 

MEASU 

NOISE 

DUCTE 

FANS 

ACO 

R  IN)  = 

58  030 

0  *  6666667 
07 14 HO WES 
DUCTE 

REAL 

CENTR 

NOISE 

FANS 

ORIGI 

POWER 

SPECT 

ACO 

R  (  N  )  =  ' 

66  C39 

0.3562517 
907BI SHO 
SCHOO 

REDUC 

MOTEL 

AIRCR 

RESID 

NOISE 

ROOMS 

MEASU 

SEVER 

ACO 

R  (  N  )  = 

56  028 

0.2416164 

427PIETR 

ISLAN 

JET 

NOISE 

PROBL 

AIRCR 

CARR! 

ACO 

RIN)  = 

56  C28 

0.2416164 
434P I ETR 

NOISE 

MEASU 

SOME 

JET 

AIRCR 

ACC 

R  { N)  = 

57  029 

0.2416164 

0854KIRCH 

ACOUS 

GREEN 

NOISE 

NQNLI 

RESPO 

AIRCR 

PANEL 

ACU 

R  (N)  = 

63  13 

0.2416164 
324R0BIN 
FL  IGH 

BOWSH 

COREL 

JUOG I 

NOISE 

AIRCR 

SOV 

RCN)  = 

65  11 

0.2416164 
170MELNI 
LA  NO  I 

NOISE 

TU124 

GENER 

PASSE 

GROUN 

AIRCR 

DUR I  N 

TAKEO 

ACU 

R  {  N )  = 

65  16 

0.2416164 

251KOPPE 

AIRCR 

MAT  SC 
NOISE 

MULLE 

VI Cl  N 

PROCE 

AIRPO 

DESCR 

ASSES 

ACO 

R  {  N  3  = 

66  039 

0.2416164 
138 WILL I 

KRYTE 

MASKI 

SPEEC 

AIRCR 

NOISE 

ACO 

R(N)  = 

66  040 

0.2416164 
108BI SHO 
NOISE 

JUDGM 

RELAT 

ABSOL 

ACCEP 

AIRCR 

ACO 

R  C  N )  — 

57  029 

0.2369385 
12  89  WARD 

HEAR  I 

NAVAL 

AIRCR 

MAI  NT 

PERSO 

ACO 

R  (  N  )  = 

59  031 

0.2369385 

1 4 15KRYTE 

SCALI 

HUMAN 

REACT 

SOUND 

AIRCR 

ACU 

R  (  N )  = 

66  17 

0.2369385 

245B0WSH 

JOHNS 

ROBIN 

FURTH 

EXPER 

JUDGI 

.  ■  ^  T  ^  r-  ■  1  <  ■ 


'  , 

ACC 

61 

033 

1  553LOEB 
TEMPO 
ACTIO 

FLETC 

THRES 

CONTR 

SHIFT 

THRES 

INDIC 

SHIFT 

ACOUS 

REDUC 

REFLE 

ACC 

R(N) 

59 

030 

0 » 1367  636 
0533JILLI 
SOUND 
SUB  J  E 

FI  ALA 
LOUDS 
JET 

METHO 

ENVIR 

MI  SSI 

GENER 
TEST  I 
ENGIN 

HIGH 

ELECT 

NOISE 

INTEN 

COMPO 

ACO 

R  ( N ) 
59 

031 

0.13 67 636 
0620MAGL1 
NOISE 

SHIEL 

SUPPR 

FLAP 

TYPE 

JET 

ENGIN 

ACO 

RI  N) 

59 

C31 

0. 1367636 
1013MA YES 
MEASU 
PRESS 

SOME 

ROCKE 

RATIO 

NEAR 

ENGIN 

FAR 

OPERA 

FIELD 

DIFFE 

NOISE 

NOZZL 

SOV 

R  IN) 
65 

10 

0.1367636 
276MELNI 
TU 124 

EFFEC 

TRANS 

FLIGH 

VELOC 

ENGIN 

NOISE 

ACO 

RiN) 

55 

027 

0.  1320857 
449 WATTE 
ENG  IN 

LA8AT 

TEST 

BE  RAN 
CELL 

ACOUS 

STRUC 

BEHAV 

SOME 

ACC 

R  { N ) 

56 

02  8 

0.1320857 

683WATER 

JET 

COOK 
ENGI  N 

BEREN 

TEST 

MEASU 

CELLS 

SOUND 

ABSOR 

SOV 

R(N) 

56 

002 

0.1320857 

335R0ZEN 

SOUND 

HUTER 

ULTRA 

BOLT 

ENGIN 

SONIC 

SCIEN 

TECHN 

USE 

SOV 

R  ( N ) 
56 

00  2 

0.1320857 

337CRAWF 

ULTRA 

ENGI  N 

SOV 

R  C ISI ) 
57 

003 

0.1320857 

91KRASI 

BERG  M 

ULTRA 

APPLI 

SCIEN 

ENGIN 

ACC 

R  (  N ) 
58 

030 

0.1320857 

0378HUETE 

SONIC 

ENGIN 

ACO 

RI  N) 
59 

031 

0. 1320357 
0065KAMPS 
PRESS 
ENG  IN 

ST  AT  I 
GENER 

EVALU 

EXHAU 

NEAR 

HIGH 

FI  ELD 
PERFO 

SOUND 

JET 

ACC 

R  (  N) 
66 

036 

0.1320857 
2027TEDR I 

S  I  NGL 

ACOUS 

NOZZL 

MEASU 

ROCKE 

STSTI 
ENGI  N 

TESTS 

CLUDT 

ACO 

R(N) 

65 

037 

0.1320857 

0  644 HARR  I 
RISK 

HEAR  I 
CRITE 

LOSS 

DIESE 

TREND 

ENGIN 

CURVE 

PERSO 

DA  MAG 

SOV 

R  (N) 
65 

10 

0.  1320857 
423CHUGU 
ENGI  N 

CONE  E 
PROCE 

APPL  I 

ULTRA 

I  NT  E  N 

CHE  MI 

ACG 


RIN)- 

67 


62 


0  *  1320  85  7 
1244WANG 


ACOUS 


ANALY 


MULTI 


CYLIN 


ENG  IN 


1 

- 

. 

■ 


■ 

• 
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APPENDIX  C 

From  equation  (2,6) f  dif f erentiating  with  respect  to 
w.  yields 

l 

2  x.(E  w.  x  )  [£  Z  c.  w  w  ]  -  ( £  w.  x.  )2[E  c.  .w.+  £  c.  .w.] 
i  1  1  j  j  ]  ]  i  i  1  -j  3-3  3 


( E  E  c  .  .  w  .  w  .  ) 

i  j  id  r  y 


(A.l) 


Since  c . .  =  c..  and  the  summation  over  i  is  identical 

J  J 

to  the  summation  over  j,  it  follows  that  equation  (A.l)  may 
be  simplified  to 


2  x . ( E  w.  x . ) [ E  E  c..  w.  w . ] 

i  .  l  l  .  .  i]  l  -] 

l  l  j  J  J 


2(Ew.  x.)  Ec..w. 
ii  i]  l 

l  J 


=  0. 


(A.  2) 


Defining  X  to  be  the  following: 

2 


X  = 


( E  w  .  x  .  ) 
l  l 
l _ 

E  E  c .  .  w .  w . 

•  •  il  i  D 

i  1  J  J 


and  substituting  into  equation  (A. 2),  equation  (2.7)  results 


I 


■ 


<  i  '  -  jt  S  ||  9  1  I  I  ^ 
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APPENDIX  D 

Since  C  =  D  +  N  (D.l) 

C"1  =  (D  +  N) ~1 

=  [D (I  +  D~" 1  N) ] _1 

=  (I  +  D~ 1  N) ~ 1  D~ 1  .  (D . 2 ) 

Substituting  equation  (D.2)  into  equation  (2.13),  equation 
(2.19)  is  obtained  which  appears  as  follows: 

r  =  XT  C"1  z 

=  X  (I  +  D_1  N)"1  D_1  Z  . 


(D.3) 


